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PREFACE 

The mathematical problems which arise in dealing with 
numerical data are attractive and important. Some know¬ 
ledge regarding them is required by workers in many differ¬ 
ent fields—astronomers, meteorologists, physicists, engineers, 
naval architects, actuaries, biometricians, and statisticians, 
as well as pure mathematicians; but until recently there 
has been very little instruction on the subject in the 
mathematical departments of the British universitiea Of 
late, however, there has been a great awakening of interest 
in it; and it is now included in the syllabus for the Open 
Competitive Examination for appointments in the Home and 
Indian Civil Services, the Colonial Service, etc. 

The present volume represents courses of lectures given 
at different times during the years 1913-1923 by Professor 
Whittaker to undergraduate and graduate students in the 
Mathematical Laboratory of the University of Edinburgh, and 
may be regarded as a manual of the teaching and practice 
of the Laboratory, complete save for the subject of Descriptive 
Geometry, for which a separate text-book is desirable. The 
manuscript of the lectures has been prepared for publication by 
Mr. Bobinson, who has performed the whole of the work of 
numerical verification and has contributed additional examples. 

The expositibn has been designed to make each chapter, 
as far as possible, intelligil)le to those who have not mastered 
the preceding chapters; so that any one who is interested in 
some special problem may, by the help of the Table of Contents, 

V 
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find and understand what is said about it without being obliged 
to read from the beginning of the book. 

One feature which perhaps calls for a word of explanation 
is the prominence given to arithmetical, as distinguished from 
graphical, methods. When the Edinburgh Laboratory was 
established in 1913, a trial was made, as far as possible, of 
every method which had been proposed for the splution of the 
problems under consideration, and many of these methods were 
graphical. During the ten years which have elapsed since 
then, the graphical methods have almost all been abandoned, as 
their inferiority has become evident, and at the present time 
the work of the Laboratory is almost exclusively arithmeticaL 
A rough sketch on squared paper is often useful, but (except in 
Descriptive Geometry) graphical work performed carefully with 
instruments on a drawing-board is generally less rapid and less 
accurate than the arithmetical solution of the same problem. 

The material equipment essential for a student’s mathe¬ 
matical laboratory is very simple. Each student should have 
■ a copy of Barlow’s tables of squares, etc., a copy of Crelle’s 
“ Calculating Tables,” and a seven-place table of logarithms. 
Further, it is necessary to provide a stock of computing paper 
(i.e. paper ruled into squares by rulings a quarter of an inch 
apart; each square is intended to hold two digits; the rulings 
should be very faint, so as not to catch the eye more than 
is necessary to guide the alignment of the calculation), and 
lastly, a stock of computing forms for practical Fourier analysis 
(those used in Chapter X. of this book may be purchased). 
With this modest apparatus nearly all the computations here¬ 
after described may be performed, although time and labour 
may often be saved by the. use of multiplying and adding 
machines when these are available. 

Attention may bQ drawn to the oppesrtunities which the 
subject presents to the research workW in Mathematics. There 
is an evident need for new and improved methods of dealing 
'with many of the problems discussed in the later chapters. * 
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CHAPTER I 


INTEEPOLATION WITH EQUAL INTEBVALS OF THE ARGUMENT 

J^'-if^ntroduction.—Mathematics is occupied largely with the 
idea of correspond ence : e,g, to every number x there corre¬ 
sponds a value of x^, thus 

flj = 1, 2, 3, 4, 5, • • • 
aj2=:l, 4, 9,16, 25, . . . 

One of the two variables between which correspondence 
holds is called the argument and the other is called function 
of that a rgunient. 

If a function y of an argument ^ is defined by an equation 
y=f{x)f where /(a?) is an algebraical expression involving only 
arithmetical operations such as squaring, dividing, etc., then by 
performing ttiese operations we can find accurately the value 
of y, which corresponds to any value of x. But if .y = logioi» 
(say), it is not possible to calculate y by performing simple 
arithmetical operations on x (at any rate it is not possible to 
calculate y accurately by performing a finite number of such 
operations), and we are compelled to have recourse to a table, 
which gives the values of y corresponding to certain selected 
values of x\ e.g. 



log*. 

X. 

logo?. 

7-0 


7-4 

0-869 232 

7-1 

0-851 258 

7-5 

0-875 061 

7-2 

0-857 338 

7-6 

0-880 814 

7*3 

0-863 323 

7-7 

0-886 491 

The question then arises as 

to how we can find the values 


of the function log x for values of the argument x which* are 
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intermediate between the tabulated values, e.g. such a value as 
7*152. The answer to this question is furnished by the 
^theory of Tntp.rjinlatia*rk^ which in its most elementary aspect 
[may be described ^ the science of “reading be tween the lines 
of a mathem atical table.” 

In the furthS^evelopment of the theory of interpolation 
it will be shown how to find the differential coefficient of a 
function which is specified by a table, and also to find its 
integral taken between any bounds of integration. 

A kind of interpolation was used by Briggs,* but interpolation of 
the kind hereafter explained, based on the rt^presentation of functions 
by polynomials, was first introduced by James Gregory f in 1670. 

Difference Tables.—Suppose a function f{u) is given in 
a table for the values a, a + w, a-\-2w, ... of its 

argument u. It is required to find the value of the function 
when the argument has the value g+ay . where a? is a fraction. 

Before this problem can be solved by the method of inter¬ 
polation, it is first necessary to form what are called th e differ- 
CTUies of the tabular values. The quantity 

'f(a + w) -f(a) 

is denoted by A/(a) and is called the first differeme of /(a). 
The first difference of is which is 

denoted by A/(a+«<;). Moreover, the quantity 
A/(a + te?)~A/(a) 

is denoted by Ay(a) and is called the second difference of /(a), 
while the quantity 

b?f(a+w)-t?f{a) 

is denoted by Ay(a) and is called the third difference of^o;),, - 
and so on. 

It is convenient to arrange the tabular values and thehf 
differences for increasing values of the argument in what is' 
called a difterenee tab le, as follows: 

♦ Briggs* method was', however, closely related to the modeni central- 
difference formulae. Gf. his Arithmeiiea LogaHthmieai ch. xiii., and hie 
THgonwriMtHa Briiawnica, ch. xii. Cf. Journal of the Institute, of Aetuai;fe$, 
14 , pp. 1, 78^ 84, 88; U, p. 312. * 

t Bigaud's Oorrespondenee of Seientijie Men of the 17th Century^ 3, p. 303^ 
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Ay(?) 

Ay(a + w) 
^y{a + 2w) 


irArjumtfid, Entry, A. A®. A®. 

a /(«) ' 

A/(a) / 

^ J Ay(a) ■ _ 

' A/(a.+«>) Ay(a) 

a + <'/(» +* * Ayi(a + ?(?)-v- 

A/(as-2wi/, Ay(a + w?) 

a + /(tt + 3vj)^v ‘V l!^^£{a-Y2v)y‘^) 

A/(a + 3^k ^ ^ A¥(a + 22^) 

a + /(a + 4?/j) V • Ay(a + 3?«)* 

and similarly for differences of order higher than the third. 
The first entry f [a) is called the leading term , and the differences 
of /(tt), that is to say A/(a), . . . are called \hi d%adina 

d iffermc^ s. Evidently e ach def erence in thela ile i s t hejium ber ‘ 
(with its proper aUjehraic sign) / o Uaincd h y sUbtroLctinq^e 
number £ mmedi(i telg above -and to JLlie left from the number 
immediately belo iv and to the lef t 

The sum of the entries in any colimiii of differences is equal to the 
difference between the first and last entries of the preceding column. 
This affords a Numerical check on the accuracy of the table. Thus in 
the above table we have . 

Ay(a + 3tt’) = Ay(tt) + Ay(a) + Ay(a + '«?) + Ay(a + 2tr). J 
An example of a difference table is the following, which represents 
the natural sines of angles from 25® 40' O'' to 25® 43' 0" inclusive at 
intervals of 20". 


Argwinmi, 
25® 40' 0" 

yT' Entry. 

0-43313 47858 66963 

A. A*. 

8-73933 Q6476- 

A» 

20" 

0-43322 21791.72439 

- 40.73056 




8 73892 3242Q' 

822 

40" 

0-43330 95684 04859 

- 40 73878 

8 73861 6864?' - 

822 

26®41'0" 

0-43339 69535 63401 

- 40 74700 




8 73810 83842.' 

820 

"‘ 20 " 

0-43348 43346 47243 

-40 76520 


i 


8 73770 08322 

823 

40" 

0-43367 17116 56565 

- 40 76343 

8 73729 31979 

821 

26' 42' 0' 

0-43365 90846 87544 

-40 77164 



. 

8 73688 64815 

821 

20* 

0-43374 64634 42369 

- 40 77986 

8 73647 76830 

822 

• 40* 

26' 43'0* 

0-43383 38182 19189 

0-43392 11789 17212 

- 40 78807 

8 73606 98023 
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It will be seen that in this case the third dliferences are practically 
constant when quantities beyond the fifteenth place are neglected, any 
departure from constancy in the last place being really due to the 
neglect of the sixteenth place of decimals in the original entries. So 
the fourth differences are zero. 

It will be found that in the case of practically all tabular 
functions the differences of a certain order are all zero; or, to 
speak more accurately, they are smaller than one unit in the last 
decimal place retained in the tables in question. This fact lies 
^t the basis of the method of interpolation, as we shall now see. 
A. 3. Symbolic Operators.—The formiilae of the calculij g^ 
^flferences may be very simply represented by the use of what 
are called symbolic operators. Of these we have already intro¬ 
duced A, and we shall now consider another operator denoted 
by E. 

Let w represent the interval between successive values of the 
argument of the function f{a), and let E denote the operation 
of increasing the argument by w,qo that E/(a)=/(a4-^); in 
general we shall write E®/'(a)=/(a + a;v3), where a? is an integer. 
JNow by definition we had A f(a + xw) = f{a + + ^) - f{a + xw), 
BO ^if{a + xw) = {E- l)f{a^xw). It is therefore evident that 
the operators E and A are connected ly the relation A = E -1 or 

E^l + A.'^ 

When symbolic operators obey the ordinary laSKfl,x>f Algebra 
they may be separated from the symbols representing tEe 
functions to which they refer and treated independently in 
much the same way as symbols of quantity. Now it may be 
easily showm that the following relations are true for the 
operator A: j 

M/ia) +A^) + A/tc) + • • •. 

^cf{a) = ktif(a), where A: is a constant factor, 
A’"A“/(®)=A”*+“/(®)» where m, n are positive 
integers. 

The corresponding identities for E are: / 

E{/(a)+/(6)+/(<') + . . .} = E/(a)^+Ej^6) + E/(c) + . . 

E^a)=iE/(a), 

=E”*+»^a). 

Thus in many respects the operators E and A behave li&e 
a^braic symbols and may be combined like them. 
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The following examples illustrate the use of these operators: 

Ex. 1. — To express the nth differences of a tahulated function in terms 
of the successive entries. 

4»/(«) = (E-!)"/(«) 

= + . . . +(-!)»}/(«), 

2! 

i.e. 

=/(a + nw) - nf{a + nw - v?) + -\- nw - 2ic) -. . . 

Ex. 2.— To express the function f{a-\-xw) in terms of f(a) and the 
successive differences of f{a\ vdien x is a positive interjcr. 


so that 


/(tf.+ .;«•) = E-/W 

= (l+A)^/(a), 

/((H-x«»)=/(a) + arA/((') + -“-jJ^A*/(o) + . . .+A»/((i). 


y 


The Differences of a Poljmomial.—We tind without 
difficulty that the difference talde for the function is 

as follows; 


x. 

y- 

A. 

A« 

A» 

A*. 

0 

0 

t- 




1 

1 

7 

6 

6 


2 

8‘ r 

19 

12 

6 

0 

3 

2%. 

37 

18 

6 

0 

4,.„ 

, , . 64 

61 

24 

6 

0 

9^ 

0 

125 


30 


© 



91 


4 

er~ 

6 

216 





It will be seen that the tmrd 

differences of this function are 


rigorously constant and the fourth differences are zero. Tins i8_ 
a ^)articular case of a general property which we shal ljtipgL. 

establisiL 7^6- 
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^ Note jfeBit the table may be extended indefinitely when we know the 
thud differences to be constant. For by definition, when we add to an ^ 
e ntry in a column of diffe r ences the correspondinfy first difference, th e 
s um so formeH ffivpg in the hnbimn. It follows that the 

‘Colunin ofsecond differences can be formed from the leading term 6 by 
repeatedly adding the constant third difference 6; the column of first 
differences being formed from the leading term 1 by adding in succession 
the second differences 6, 12, 18,. . . The values of are then obtained 
from the leading term Q by adding in succession the first differences 
1, 7, 19, 37, 61, . . . I 

Consider the c^ise when the tabulated function f{x) is a 
polynomial of degree n, say, 

y(ic)==Aa5’* + Ba;®“^ + Ca5«”*+. . . + Laj+M. / 

Then 

A/(«) =/(«+“’) -A«) - 

. .+Lw. 

Now 

I (a+^•)”=~ ^+. . .+«?", 

80 that 

A/(a)=A{TOwa""^+^^^Y—+ . . . + «;*} 

+ . . . 

+lao. 

This is a polynomial of degree {n - 1) in a, and therefore the 
first differences of a polynomial represent another polynomial 
of degree less by one unit. 

By repeated application of this result we see that 

the 2nd differences represent a polynomial of degree »- 2, 

» 3rd „ )f ff ff 71 


^ . ff ftth. ff ff ff ff Of 

».«. the nth differences are constant. It follows, therefore, that 
the (n + differences of a polynomial of the nth degree are 
stero. * 

^7^. The Differences of Zero. —A table of values of any power 
^ natural numbers may be formed by simple addition whSn 
term and the leading differences are known, in 
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precisely the same way as in forming the table of cubes (§ 4). 
The differences of the leading term 0^^, which are generally used 
in forming a table of are known as the differences of zero. 
They are of frequent occurrence in the calculus of differences. 

In order to form a table of reference of the differences of zero we 
apply the result of § 3 (Ex. 1), 

A"/(a) =/(a + nw) - nf(a + nw -w)-h ^n{n - !)/(« + nw - . . . 

and write 

A“ic^^ = (a: + n)i*-n{^c^-(n-+ l){y; + (n-2)J?^-. . , 

If we now substitute in this equation particular values for jr, p, and w, we 
obtain the equations 

AwOi’ = nV - n{n - \)P + |n(rt - 1) (u - 2)i> -. . . ± n.lP + 0i\ 
l == l)P f |(?i - 1) (91 - 2)l> -. . . ± lP-\ 

and therefore A’^0^^ = 9iA" ‘ ^ li>" \ (1) 

Eroiu the relation A“^y(a + tr) = A”/(tt) 4- A*‘“y(u) we see that 
A”“iy^”i = A"0^^~i +A*^~i 0 i* 7 ', and equation (1) may be written 

A«0i^ = n(A»»0i»"i ^ /^n-iop-i^ (2) 

We now construct a table of values of by the lepeated applica* 
tiuii of this equation, remembering that A®0^ = 0, A^O^ = 1, and also 
that A^0P = 0 for 9i>p. 



AOr. 

A*0r. 

A»0^\ 

A^Or. 

A*0^. 

A«0^. 

1 

1 






2 

1 

2 





3 

1 

6 

6 




4 

1 

14 

36 

24 



5 

1 

30 

150 

240 

120 


6 

1 

62 

540 

1560 

1800 

720 

7 

If 

126 

1806 

8400 

16800 

15120 

8 

254 

5796 

40824 

126000 

191520 

9 

I 

510 

18150 

186480 

834120 

1905120 

10 

1 

1022 

55980 

818520 

5103000 

16435440 


From equation (2) w’c see that the value of a particular difference 
Aw is obtained by taking n times the sum of the two numbers of the 
preceding row which arc situated in the same column and in the preced¬ 
ing column respectively. For example, 

A80’=: 3(62+ 640) 

y .= 1806 . 

6^ The Differences of x(x-l)(x-2) . . . (x-p + 1).— 

Among the polynomials of degree p there is one polynomial of 
social interest in the theory of interpolation, namely, 

( b (® - 1 ) (® - 2 ) . . . 7 (» -p + 1 ). 
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p^ltexnial is denoted by [»]!’ and is called a faetorud.' If' 
suppoM the interval of the argument in the difference table 
”dt [as]*’ to be unity, we have 

[aj’=a(a-l)(a-2) . . . (a-p + 1), 

[a + l]*’«=(a+l)a(a-l) (a-2) . . . (a-p+2), 

AEa]!*=[a + Ijp - [a]** 

=a(o-l)(a-2)(a-3)... («-p + 2){(a + l)-(a-p+l)} 

=p[a]P-\ 

so that A[a;]i’=p[a!]P-i.* 

It follows that 

A[xlp ^ [a;y-i [x-{-l]P [xy ^]P“i 
p\ (jJ-1)!’^^ p\ ^ j?! i)!’ 

a result that may now be used to tabulate the values of [x]P/p ! as in the 
following table: 


X, 

MV2!. 

W/3!. 

W/4!. 

M*/5! 

0 





1 

0 




2 

1 

0 



3 

3 

1 

0 


4 

6 

4 

1 

0 

6 

10 

10 

5 

1 

6 

15 

20 

16 

6 

7 

21 

35 

35 

21 

8 

28 

56 

70 

56 

9 

36 

84 

126 

126 


/ 7. The Representation of a Polynomial by Factorials.— 

In § 4 we found an expression for A/(a;), the first difference 
of a polynomial of degree w, in a form which is less simple 
than the polynomial itself. It is more convenient to carry out 
the operation of differencing by the use of factorials, using the 
relation of § 6: 

A[xY=f[x\P-\ ( 1 ) 

* 

, Let denote a polynomial in x of degree k. "We may 
imfe where r is the remainder and 

the quotient when ^t{x) is divided by (®-m+ifc), so 
of degree (A-1). By a repeated application of t^s 





INTEEPOLATION ' 4 ' 

transformation, we obtain an expression for a polynemial of the "• 
nth degree in terms of factorials: 

= o + ^[a;] + [a5]V«-2(«) 

=a+^[a:] + 7[a:p+[a:]®./,,, _ j(i») 

• • • • tf 

=a + /3[a;] 4-y[a:p + . . . + [a;]'*<^o(»). 

where a, p,y, . . . are constants and is a constant v (say). 
We thus obtain the result 

<tin{(c) = a + P[x]+y[xf + S[xf + . . . + v[aj]" (2) 

JSi,—To represent the function y = o:4—l 2ic® + 42^2 — 30a; + 9 and its 
successive differences in the factorial notation. 

Using detached coeflicieiits when dividing by a*, a; — 1, a; — 2, , •. 

1 1 - 12 + 42-30 9 

Ojf 1 -;^11 ji31_ 

2 i-ll+31 1 

0+ 2-18 

3 1-"9T'13 
0+ 3 I 

we obtain the value of y in the f< rnt 

y = [,rf - G[af + 13[a;]2 + [x] + 9. 

The successive differences are given by 

Ay = 4[:rf _ 18[af + 26[a:] + 1, 

12[a*]2-36[.r]+26, 

A^y = 24[a;] — 36, 

A% = 24. 

Now let a be one of the tabulated values of the argument 
of a polynomial of degree n, and let w be the interval between 
successive ♦values of the argument. Consider the value 
f{a+xw) of the polynomial corresponding to the value {a-^-xw) 
of the argument. Writing f{a+xw) for in (2) and 

applying the operation denoted by equation (1) to both sides o! 
equation (2), we find that 

u A^a+a^)«j8 + 2y[a?p + 35[a?p + . . . + nv[x]^^\ (3) 

* Chryatal, Algelra^ 1, p. 108. 
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iBififerenping this equation, we obtain 

: A^a== 2y + 2.3S[aj]i + 3.4e[a5]*+. . . + n{n - l)v[aj]""®. (4) 
Moreover, 


A^a+a:«^) = 2.3.8+ 2.3.4€[a;]i + 3.4.5^[a?p-f. . . 

+ n{n-l) {n-2)v[x\^~^, ( 6 ) 

and so on for differences of higher order. The values of the 
coefi&cients a, y, . . . are found by putting a?==0 in each of 
the equations (2), (3), (4), ... so that 

r = S=iA»/(«), . . . v^A^f(a)ln\. 

Equation (2) may now be written 


+ 


x(x~-l){x-2) , . . (a;--?i + l)^^ 


' This formula enaUes us to express the polynomial f{a+xw) 
in terms of the factorials x, x{x -1), x{x -1) (a; - 2), . . . when a 
difference table of the function is given. 


This general fornmla may be easily verified for special values of x. 
When a = 0, it becomes /(a) ==/(«). 

When »= 1, then 


f{a + w) ^f{a) + l.Af(a) 

=/(a)+- {f{a-i-w)’-j[{a)}, which is an identity. 

When 3! = 2, 

/(a + 2w) =f{a) + 2 A/(a) + A^fia) 

=/(a) + 2{/(a + w;)-/(a)} 

+ {f{a + 2w) - 2f{a + lo) +/(«)}. 


k', 


The Oregory-Newton Formula of Interpolation. —The 

general formula of the last section may be applied to solve the 
problem of interpolation. 

Suppose that y is a function of an argument u and that the 
values of y given in the table are f{a), f{a+w), f{a-^2w), 
f(a + 3w), . . . corresponding to the values a, a+w, a-\-2w, 
a+3wj... of Also suppose that th^se values of the function 
are entered in a difference aud that the differences of 
n are constant. We are^noCilupposed to know the values 
of y which correspond to other values of u, such as + 

* Of. Ex. 2, § 8. 


'V 
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It is required to find an analytical expression for these 
mediate values of y. 

The problem may be stated graphically as follows: 

Draw the rectangular axes On, Oy. Let K, L, M, N. . 
be points on the u axis having abscissae a, a-\'W, + aif^, 

. . . respectively. At these points erect ordinates KA, LB, 
MO, ND, . . . equal respectively to the entries f{ci)i + 
f{a + 2w), f{a + Ziv), . . . Then the points Ay B, (7, j9,. . • so 
determined are points on the graph of the function.* The 
problem of finding a smooth " curve to pass through the points 
A, B, O, has not a unique solution: in fact an infinite 

number of curvets satisfying these conditions can be found. As 
our aim is a practical one, we naturally choose the simplest, 
soluthm of our probbnn. y 
^ Remembering that the 
simplest functions are poly- 
I nomials, W(^ inquire if it is 
possil)lo to pass through the 
points Ay B, Cy , . , a curve 
whicli is the, gTaph of a 2 ^oly^ 
nomial function (jf dt'gree n. 

We have already seen 
(§ 4) tliat for any polynomiaj 
of degree n the dilferences of order n are constant and for the 
set of values/(^/),/(ry,-f-?£’),/(«+ 2^r), ... it has been assumed 
that the dillerences of order n are constant. This being so, 
a polynomial of degree n exists wliicli takes the values f[a), 
f{ii-\-w)y f{a + 2w)y . . . when the argument u has the values 
a, a^Wy a + 2w, . . .; in fact, by the last section, we can write 
down an expression for the polynomial. It is 

^=/(a)+a;A/(a)+^2^U*/(a)+. . . 



Fio. 1. 






* We do not know anything about the portions of the graph intermediate 
between those points, but wo assume that the graph is a smooth cwrv€\ for 
our prejont purpose we can take this to mean that tlie function has ilhittf , 
differeptial coefficients of all orders at every point. 
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^ where x is connected with u by the relation 
and where 


A/(a) stands for/*(fl^ + w) - f{a), 

A^^a) stands for f^a + 2‘w) - 2f{(i + w)-\-f{a), 
and so on. 

We shall noiv take the polynomial (1) to represent the function 
y also for values of the argument intermediate hetween the 
tabulated values. The portions of tlio graph intermediate 
between the points A, B, C, . . . may therefore be filled in by 
drawing the curve 

y=^f{a + xu^ 

+ . . . ( 2 ) 

and in order to compute the value of y corresponding to any 
intermediate value of the argument sucli as a-\>\u\ we*, simply 
substitute the value x- \ in this formula,* wliich is the analyti¬ 
cal expression required. 

The fundamental problem of interpolation is tlius solved. 
The formula (1) is often referred to as NeiotorCs formula of 
interpolation, although it was discovered by James Gregory in 

levo.t 

The application of the Gregory-Newtoii formula is illustrated by the 
following exanqdes: 


* Many books of logarithmic tables, etc., contain a table of the binomial 
coefficients required in the interpolation formula (1), at intervals of 0*01 from 
aj=0 to 0 ^= 1 . 

t Cf. a letter of Gregory to Collins of date November 23, 16/0, printed in 
Rigaud’s C(yrrespondc'Hce, 2, p. 209. An example of the use of the formula is 
worked out on p. 211 of Rigaud. Collins was accustomed to send on to Newton 
the mathematical discoveries of Gregory (cf. Rigaud, 2, p. 835). 

Newton’s publications on interpolation are contained in : 

1. The Metliodus Diffcrentialis published in 1711 but written before 

October 1676. 

2. A letter written in 1676 to John Smith. 

3. Lemma v. in Book iii. of the Principia publis^icd in 1687. The above 

formula is Case i. 

4. Various references in the Commercium Epistolicum of dates 1672/3 to 1676, 
.These have been collected and edited by I). C. Fraser in the Journal of th^ 

pMIlUvie of Actuaries, 51 (1918-19), pp. 77 and 211. 
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ISx. 1.— From the table given below to find the entry corresponding to 
. x=21. 

Argument, liutig. A. A’. A®. <S*. 

20 0-2203 uonsats 

7017472)7 

22 0-230016702195 002297 . 

702340544 * - 1944 

24 0-230719052030 600353 4 

702949897 -1940 

26 0-231422001030 598413 3- 

703548310 -1037 

28 0-232125550240 596470 

704144786 

30 0-232820605032 

Hci-e rt==20, U- — 2, /(a + .nr) anil:>: = J. 

/(21) =/(20) -|. .vA/(20) + 7 ^^A2/(20) + +. . . 

1-4 - O 1 

= 220314055248 + (701747247) - J (602297) - 
= 229314055248 ^ _ f 76287-1 

+ 350873623-5 ' \ +321-5 

= 22066582887)-5 - 75408-6 

BO 

/(21) = 0-220665753463. 

. 

2 .— To jhul flui co-onhnate X of the sun- on Novemhcr 10, 1910, at 
4^ .‘{O’” O.M.T. (X i.i the true (jeoe.entric co-ordinate measured on a line 
jnissing throwjh the true equinoj'. of the date). 

Tlio Nautical Almanac gives the following readings from which we 
construct a diilenuice table : 


1010. 

-X. 

A. 

A®. 

A3. 

Noveml)er 9-0 

0-G850997 

- G3809 



9*5 

0-G787188 

- 64323 

-514 

4 

10-0 

0*0722865 

-64833 

-510 

7 

10*5 

0*6658032 

- 65336 

-503 

2 

11*0 

0*6592696 

- 65837 

-501 


11-5 

0*6526859 





* VVe must interpolate for 4^ 30® from No^'cmbe^ 10-0. The interval is 
12^. Then 4^ 30®, as a fraction of the argument, gives x = 0*375. ^ 
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log »= 9-6740313 
log (x - 1)= 9-7968800(n), 

where (») indicates that 9-7958800 is the logarithm of a negative nnmhei 

log 1 = 9-0989700 
log lx(x - 1) = 9-0088813(n) 
log ■J = 9-6228787 
log (a - 2) = 0-2108534(n) 
log ^x(x - 1) (» - 2) = 8-8026134. 

Also 

log (- 64833) = 4-8117961(n.) log (- 503) = 2-701 r)C80(»i) 

log a:= 9-5740313 log - 1) = 9-0C88813(ji) 

log (-64833a!) = 4-3858274(n) log la:(a: - 1)( - 603) = 1-7704493 

= log(-24312-4) =log 58-94 

log 2 = 0-3010300 
log \x{x - !)(» - 2) = 8-8026134 
log Jx(x-l)(a:-2)(2) = 9-1036434 = log 0-1. 

Therefore - X = 0-67228650 - 0-00243124 + 0-00000589, 
and finally -X = 0-6698612. 

» 9. An Alternative Form of the Gregory - Newton 
''Formula. —The Gregory-Newton formula may be written in 
ah alternative form which is convenient when an arithmometer * 
is used. Rearranging the formula of tlie last section in the form 

and assuming the dilferenees of order n to be constant, 'we may 
replace the Gregory-Newton formula by 

j\a + xv})=f(a)+xu^, ( 1 ) 

where =A/(«) - ^ (1 - *)%, 


• • • • 

w,j=Ay(a), which is constant.)^ 

When computing a value of the function by this method, we 
Ijpgin with the constant difference and calculate in succession 
the values of • • •, fina}ly substituting the value 

of in equation (ij. The following example will serve as an 
illustration of this method: 

* When an aiithmometer is not available CrelU^s Calculaiing Tables 'will be 
^fotud useful for this purpose. 
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Ex.—To find f{6) when 0 = 24/46980 09207 020, haviny given 


e. 

/{«)■ 



A. 

A2. 

A». 

A\ 

24*4 

0-216 198 

561 

343 










168 

272 307 




L 24 ^ 

0-216 366 

833 

650 



745 715 







169 

018 022 


768 


24-6 

0-216 5.35 

851 

672 



746 483 


6 





169 

764 505 


773 


24-7 

0-216 705 

616 

177 



747 256 


4 





170 

511 761 


777 


24-8 

0-216 876 

127 

938 



748 033 


5 





171 

259 794 


782 


24-9 

0-2)7 047 

387 

732 



748 815 







172 

008 609 




25-0 

0-217 219 

396 

341 






Hon', 

■)() = 0-1, 

a - 

= 2.r-4, 

1 ‘ 

= 0-698 005 207 02. 



Hence 


4(3 

-x)Ay(«)=7 

68-0-5: 

r6 X 5 




= 765-1, 









K2 = ^*/(‘0- 

4(2 


= 745 ' 

715 -0-4.34 Ox 765-1 




= 745 ;i83-0, 








= A/(a) - 

■ 

- x)H2 = 

-168 2 

72 307 « 

• 0-150 997 4 X 

: 745 383 


= 168 ir>i) TOC-l. 

Then 

/(a + xir) =/(«) fxHj 

= 0-216 1!)« 561 31;5 + 0-698 005 207 02 x 0-000 168 159 766 
= 0-216 198 561 343 
+ 117 376 386, 

^6r /(0)'= 0-216 315 937 728. 

/’ JlO. The Binomial Theorem, —T5y use of the operator E, we 
can write the Gregory-Newtou interpolation formula in the form 

= {1 + a-A+A* +.U(o). 

WHcn thus written, tlic formula is seen to he the same as that 
obtained by expanding (1 + A)^ by the Binomial Theorem in 
ascending powers of A and then operating on f{a) with the 
terms of the series so formed, i.e. 

The Binomial Theorem was made known (in correspondence) 
six years after the Gr(jgory-Newton formula; in fact, Newton 
seems to have discovered the Binomial Theorem by forming the 
expansions of (1 +aj)" directly for integral values of n, and then 
^ writing down the powers of x in these expansions. In the 
case of the coeflicient of x^ he would have: 
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Coefficient of 

A. 

A*. 

0 

0 

0 


1 

0 





1 


2 

1 

2 

1 

3 

3 


1 



3 


4 

6 

4 

1 

5 

10 




whence evidently the coefficient is of the second degree in n. 
Since it vanishes when 7?. = 0 and also when 7i = l, it must 


contain the factors n and [n ~ 1); and, since the coefficient has 


the value 1 when 

n = 2, it is 

n{n -1) 

2 ’ 



We may remark 

that if we form a difference table for (1 

+ x)^ thus : 

Argument, 

Kntrg, 

A. 

A®. 

A«. 

0 

1 

X 



1 

(i+# 


x^ 




a:(l + x) 


a-® 

2 

(l+af 


a;2(l + x) 




a(l +xf 


x3(l+a;) 

3 

(!+»•)» 


a;2(l +# 


then on substituting 

■ the values /(O) ~ 1, A/(O) 

= « . . , in 

the Gregory- 


Newton formula 

/(w) =/(0) + nA/(0) + \n{n - l)A2/(0) +. . . 
we obtain (1 + a;)’* = 1 + tijk + — — — a;^ +. . . 

which is the binomial expausioii. 


Examples on Chapter I 


1 . 


(a) 


Form the difference tables corresponding to the following entries: 


6, log tan B, 


26 ' 10 ' 0 * 

9-691 380 858 103 01 

10 ' 

434 054 052 28 

20 ' 

487 246 020 72 

30 ' 

540 434 009 42 

. 40 ' 

► 693 618 019 47 

60 ' 

646 798 051 97 

26 ' 11 ' 0 ' 

9-691 6*99 974 108 01 

10 ' 

753 146 188 70 

20 ' 

806 314 295 11 

30 ' 

859 478 428 36 
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X. 

sin X. 

28" 40' OO'^ 

0-479 713 113 250 246 

10" 

755 651 470 168 

20" 

798 188 562 452 

30" 

840 ^24 526 998 

40" 

883 259 363 705 

50" 

925 793 072 474 

o 

o 

o 

00 

968 325 653 205 

10" 

0-480 010 857 105 798 


2. If 1 /= - j:“ + 3a; + 1, calculate^, the values of y corresjmiiding to 

a; = 0, 1, 2, 3, 4, 5, and form the talde of differences. Prove tlieoretically 
that the second difference is 12a;-4-10 and vcirify this numerically. 

3. Find the function whose first difference is the function 

Uij'^ + /5a;^ + yx + S. 

4. Find the successive differences of 

(a) 1/a;, the interval being unity, 

(b) vxjs na*, the interval being w. 

5. Exi^ress f(ic) ~ 3;/;^ + x -f* 1 in the form 

ax(x - 1 ) (.c - 2 ) + fir{x - 1 ) + yx + 8 

by comparing coefficients. Calculate the values of/(a;) for x = 0,1 , 2, 3, 4, 6, 
etc., and form a difference table. Verify the equation 

M =/(0) + .':A/(0) + ’'a2/(0) 4 -^-?^AS/(0). 

6. Compute the thiixl difference of /(hi) by the formula of § 3, Ex. 1, 
from the following table of entiies : 

X 51 52 53 54 

f(x) 132651 140608 148877 157464 

verifying the result by means of a difference table. 

7. Given the table of values 

.a; - 3 -2 ~1 0 1 

y 16 7 4 1 -8 

find by means of the Gregory-Newton formula an expivssion for y as & 
function of x. 

8. Construct a difference table having given 

log 6-980 = 0-776 701184 0 
log 6-981 = 0-776 773 802 4 
log 5-982 = 0-776 846 408 7 
log 6-983 = 0-776 919 002 8 
log 6-984 = 0-776 991 684 9 

and determine log 5-9805. 

9. Let p, q, r, s be successive entries in a table corresponding to 
Squidistant arguments. 

Show that when third differences are taken into account the entry 

(d8U) * 



18 THE CALCULUS OF OBSEEVATIONS 

corresponding to the argument half-way between the arguments of g 
and r is 

+ ■ (IXMo-S-O 

10. Let p, g, r, s be successive entries (corresponding to equidistant 
arguments) in a table. It is required to interpose 3 entries (corre¬ 
sponding to equidistant arguments) between q and r, using third differ¬ 
ences. Show that this may be done as follows: 

Between q and r interpose 3 arithmetical means A, B, and C; also 
between ^q — 2p-s and 3r - 2s — p interpose 3 means A', B', and C'. 
Then the 3 terms required are A +-/oA', B + C 4 

(De Morgan.) 

11. Determine log 6*0405, liaving given 

log 6*040 = 0*7810369386 
log 6*041 = 0*7811088357 
log 6*042 = 0*7811807209 
log 6*043 = 0*7812525942 
log 6.044 = 0*7813244557 

12. Using the inetliod of § 9, find sin 24“*4698006207, having given 
the values 


0. 

ain 6. 

24*25 

0-410718852614 

24*50 

0-414693242656 

24*76 

0-418669737537 

25*00 

0-422618261741 

25*25 

0-426568739902 

25*50 

0-430611096808 

the values 


X, 


0 

858-313740095 

1 

869-646772308 

2 

880-975826766 

3 

892-303904683 

4 

903-630006876 


calculate/(I*5) by the Gregory-Newton formula. 

14. The values of a function corresponding to the values 1, 2, 3, 4, 5 
of the argument are 0*198669, 0*237702, 0*276355, 0*314666, 
0*352274 respectively. Calculate the values of the function when the 
argument has the values 1*26 and 1*75 respectively. 

15. Using the difference table given in § 2, find the values of 
sin 25° 40' 10" and sin 25° 40' 30". Also verify the answers 

sin 26” 40' 60' = 0-433 353*^61 493 41C, 
sin 26° 41' 10'= 0-433 440 644 614 711, 
sin 26° 41' 30' = 0-433 628 023 660 896, 
sin 26° 41'60'= 0-433 616 398 631 149, 
dbttined by taking x numerically less than unity in the formula of § 8. 
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16. Calculate log tail 24“ O' 5", given the values 

log tan 24“ O' 0* = 9-648 683 137 400 95 
log tan 24“ 0' 20* = 9-648 696 457 723 08 
log tan 24“ 0' 40* = 9-648 809 758 267 66 
log tan 24“ 1' 0* = 9-648 923 039 045 83 
log tan 24“ 1' 20* - 9-649 036 300 068 75 
log tan 24“ 1' 40* = 9-649 149 541 347 57. 


17. The following table gives the values of !{?•) — 




/(■'■). 

0-00 

0-886 226 92 

0*01 

0-8Y6 227 24 

0-02 

0-866 229 57 

0*03 

0-856 235 90 

0-04 

0-846 248 22 

0*05 

0*836 268 53 


Calculate I{if) for 'x — 0-025 by interpolation anti verify your result hy 
use of the formula 

S.7 





CHAPTER n 


INTBEPOLATION WITH UNEQUAL INTERVALS OP THE ARGUMENT 




>^1. Divided. Differences. —We have so far assumed that the 
values of the argument proceed by equal steps; but with data 
derived from observation it is not always possiljle to complete a 
difference table in this way. For example, when astronomical 
observations are disturbed by clouds there are gaps in the 
records. 

Consider the case in wliicli tlie values of the argument, for 
which the function is known, arc unequally spaced, and suppose 
that the values of f{x) are known for x^a^, 

. . ., a? an, where the intervals - a^y ag - aj, , ., 

an~an-i need not be equal. In place of ordinary differences 
we now introduce what are known as divided differences,^ Let 
UB form in succession the quantities 

-a-, -A'H. « 0 , 

and so on. These are called divided differences of the first order. 
Moreover, let us form 






These are called divided differences of the second order. Also let 

«a. «i) -y(«2. «i. «o)}/K - «o) =‘A<h> «a. «i. ®o)- 
This is called a divided difference of the third order. The 
divided differences of higher orders are formed in the same way, 
so that the order of a divided difference is less by imity thsm the 
^S^ber of arguments required for its deiinition. 

* Divided differences might fairly be ascribed to Kewtonv Lemma v. The 
tem was used first by De Morgan, Diff, and Ini, Calc. (1842), \ 650, and after- 
'waw by Oppermann, Joum, Inst. Ad, 16 (1869), p. 146. i&m^re, Awn, de. 
^gannfif 26 (1826),:^. 829, used the name inter^olaiory fundiwk, 
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Divided differences may be expressed more symmetrically ■ 
as follom: 


«o) 


A^o). ^ A^h) 

a^-Oi a^-a^’ 


*i> *o) 

=_.l _./ A"2 ) , /(“i) 1 + 1 C ^i)_., A%) 1 

^ _ A'^o) _ + A<^i) ^ _., /K) _ 

(«o “ O'o ~ ‘^a) i^h ~ %) (‘‘i ~ '‘a) (“a “ “i) (*a “ ®o)’ 


j{a^a^,a^,n^ 

=, . . .^A«-pI __ _.+_JKJ_ 

(ao - «i) («o - «a) («o - “a) («i - “o) (“i “ «a) («i - “s) 

^_/W_+_ /( “a) .. 

(«a - “o) ('*2 - «i) («a - “a) («3 “ «o) (“a “ «i) («a “ “a) 

/ 7 i general, as may easily he shoivn hy induction, a divided 
difference o/thepth order is a symmetric function of its arguments 
and is in fact the sum of (p + 1) functions of the form 

_ Aj^) _ 

difference-product of Ur with «o, ai, a^i, . . a^^i, . . 

It is evident from tliis statement that when the arguments required 
to form a particular divided difference are arranged in a ditfei’ent order, 
the value of the divided difference remains unchanged, e.g. 

®n-2> • • •> ®o) • * •» ^n)* 


^ivided differences are arranged in a taUe of divided differ-^ 
ices as follows: 


ArgiwierU. 

Entry, 



«0 

^«o) 





y(«o. «i) 



/(«l) 


/K. « 1 . ®a) 



A^v ®a) 

/(«„, aj, a„ oj) 

«a 

A<h) 

• 

/(*!, <* 2 > ®s) 



A<^2' “a) 

/(«!, rt2. ffla. * 4 ) 

«8 

A<h) 


;t® 2 . «a. —' 

• 


A«3. ««) 

®4> ** 5 ) ■ 

«4 

A<^t) 


' A^ ®4> * 5 ) 
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v/lhe following inay serve as an example of a table of divided 
differences: 


X. 

f{x). 

/\ 



0 

132651 

8113 




-^48877 


1.08 



-- 

-^-.8587' 


1 

3 

1674TS4 


- 162 




89 U 


1 

•4,. 

166375 


167 




9570' 


1 

7 

19.5112^ 

■ 

173 


’ 


10444 



9 . 

216000 





In this example the differences of the thiitl oider are constant. We 
shall now see under what circumstances a column of constant divided 
differences is obtained. 

Theorems on Divided Differences. 



I. If a function f(x) is numerically equal to the sum of two 
functions g{x)y h{x),for a set of values of the argument x, then 
any divided difference of ffx) formed from those values is equal to 
the sum of the corresponding divided differences of g{x) and h{x)\^ 
For example, 

fla a \ ~ ~ 

=S'K. <*o) + ^K. «o). 

and similarly for differences of higher order. 

IT. A divided difference of cf{x), where e is a constant factor, 
is e times the corresponding divided difference offfx). 

For example, the divided difference of the first order of efiff) is 




= c/(«i.«o)- 


III. The divided differences of order n of are constant 
{where n is a positive integer). c 

Let /(») =a!". 

Then /(oo. "i) “ K” - «i")/(«o “ «j) 

/!) j «»ao""i + ajaa""® + . . . + 
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a homogeneous function of Wo, of degree (n -1). Moreover, 
/t^o. «i. «a) 



•1+ 



»“i + «ift2"-* + . . . + 





ao-a2 



= K”- 



- «2) + - 

«2"‘“)/K-«2)+- • • 






+ “i”'>0-«2)/K- 

^ 2 ) 

11 

1 

2 + 

a2V*'® + - 

. . + a2“‘*) 




+ ai(«o”'® + «2«o”'* + - • . + «2”'®) + - • • 


which is a homogeneous function of of degree {n - 2). 

In general /(^o, a^y . . a^) is a homogeneous function of 

•••!«/> of degree -- 2^)- Taking = ti, we see that 
/K. «!, « 2 > • • ^n) is a constant. 

C Corollary: The dimtled differences of order (w + 1) o/a;" are zero, 
^ ‘l IV. The divided differences of order n of a ^lolynomial of the 
nth degree are constant. 

This theorem follows immediately from theorems I., II., and 
III., since the divided diflerence of order n of each of the terms 
whose degree is less than n is zero. 

V. A divided difference of order r may be expressed as the 
quotient of tivo determinants each of order r + 1. 

Consider the divided diflerence of the third order, 


/l[ao,ai,a2,a3) = : 




'K-«i)(«0-«2)K-“3) 


^ v,As) (difference-product of ' 

"" diflerence-product of a^, a^, a^y a^ 

Now a difference-product may be expressed as a determinant 
of the kind known as Vandermonde^s, thus 

(difference-product of a^, a^y a^ = I a,* a,y^ 


1 


Therefore 

/(®o» ^2> ^a) “ ^ 


n 


a^ 




a ,® 

« 8 ® 

« 0 * 


^ 2 ® 


«0 


«2 

«8 

1 

1 

1 

1 
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w«o) yi[«i) y(«2) Ah) 

"o «1 ®2 «8 


®1> % “s) “ i" 


11 1 1 1 I 

And general for differences of order higher than the third. 
l5^f^Newton*s Formula for Unequal Intervals.—^Let f{u) 
be a function whose divided differences of (say) order 4 vanish 
or are negligiljle; and suppose its values for 4 arguments 
®o> ^3 known so that the table of divided differences 

is as follows; 


Argument. 

Entry. 

Ah) 


' 

A 




Ah «i) 





Ah) 

/(«!. «2) 

Ah h h) 

/(ctQf ^2, 

h) 

«a 

Ah) 

Ah h) 

Ah h h) 



H 

/(«3) 






We may obtain the value of the function for any other 
argument u in the following way. Beginning with the constant 
difference which is of order 3, we have 

f{u, rto. «!, « 2 ) =/(« 0 . « 1 . « 2 . "s)- (1) 

By definition of the divided difference of order 2, 

f{u, a^, a^) =yi(ao- «2) + (“ " «2W“» «o. «i. “ 2 ). 

and therefore 

/(«, ao. “ 1 ) =/(«o. « 1 . “ 2 ) + (« - « 2 jl/(«o> « 1 . « 2 . “s)- (2) 

Again by definition, 

f^Uf Uq) ttj) + (w — ^i)> (^) 

and substituting in this equation the value off{u, a^, a^) from (2), 

«o) + (“ - «i]!A« 0.®1>®2) ^ (« - (« - «2M«0»«a»«2»«»)‘ 

Ako t>y definition f{u) =^ao) + (“ “^®o]lA®> ®o)« (111 

or J{u) »y(ao) + (m - a,) + (« - <* 0 ) (« - 

+ (ft - flj) (» - aj (ft - a^/(<% Oi, Og, <%),' .Ml 
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From the equations (1), (2), (3), (4) the quantitie8yi;M,a^ai,a,)l 
J{u, aj, f[u, are now known and may be inserted 

in the table of divided differences thus: • 


ArguTHient. 

Entry, 






U 

y(“) 

/(«o) 

/(“. «o) 






/(«, «0. 

“l) 

Ah «0. «1. 

h) 

!/ 

Ah) 

yi«o. »i) 

./(«0. Hy 


H 

/(“l. »2) 

“ 2 ) 



y{“a) 

AH> «2. 

" 3 ) 

AH> «1. «2- 

«3) 


AH> «3) 


y(«3) 




«s 







Formula (5) may evidently be generalised to express a 
function whose divided differences of order (n + 1) ai-e negligible 
^ or zero, in the form 

/(») =Ah) + (“ - «ol/’(«o. "i) + (« - «o) ('^ - "i. « 2 ) 

+ (m - (fo) (w - ((,) {u - «2)/'(S. «1. «3) + • • • 

+ (m - «o) (“ - «i) • • • (“ - «i. • • ■. «»)• (6) 

This formula was discovered by Newton.f 

The first term on the right-hand side of this eciiiation represents the 
polynomial of zero degree, which has the value /(u^) at The 

first two terms togetlier represent the polynomial of degree 1, which has 
the values /(aQ) and /(<ij) at and etj respectively, and so on. 

The remainder term which must be added to the right-hand side of 
the equation in order to obtain strict accuracy is in fact 

(a - ao) (w - aj . . . (m• • -i «»i)- 

But this term vanishes if the divided differences of order n are rigorously 
constant. 




Ex.—From the table given below to find the entry corresponding to 3*7G08^ 


X. 



ao=0 

•3989423 


v/* 


- 500 

a-^sa 2*5069 

•3988169 



• 

-1499 

(*2= 5*0164 

*3984408 



• 

-2496 

7*5270 

*3978138 



-199 

-199 


♦ In practice the value of f{u) is usually found by fomiiiig the successive 
divMed differences in this way, as in the worked-out example below. 

t iVtnctjpia (1687), Book iii. Lemma v. Case ii. Cf. Cauchy, (Euvres^ (1) 5, 
p. 409. __ 


(D811) 


2* 
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Forming the successive divided differences of /(w), where uss 3*7608, 
we find 

f(u, (»o, <»i)=/(ao, a^= - 199, 

/(«, tt^= - 600+1-2539 X ( - 199)= - 749-526, 

/(m) = -3989423 + 3-7608 x (- 749-526). 

The calculated value is therefore 0*3986604. 


14. The Qregory-Newton Formula as a Special Case of Newton’s 
Formula. —The Gregory-Newton formula may he regarded as the special 
case of the formula of the last section when the intervals of the argument 
are equal. 

For in Newton’s formula for unequal intervals suppose that we put 
«Q = a, «! = a + ttg = a + 2tfj, . . ., u — a + xw, 
i By constructing a table of divided differences, w'c see that 
/(««» «i)=/K. «*)=+ «’). 

-•- /(«<). “I. = 2 

In the same way we find 

/(ao. “1. «i> "3) = 3 

and so 011 . 

If we now replace tt by a + the fonmila for unequal intervals of 
the argument becomes 

f{a + <m) =/(o) + x^f{a) + ?^~lW/(a) + ?^iZ^^liW/(tt) +. . . ^ 

which is the Gregory-Newton formula. 


. 15. The Practical Application of -Newton’s Formnla.— 

Tn laboratory computation from Newton's formula, we proceed 
by a method which is really identical with that given above 
(Ex. § 13). RearrangiDg the formula of § 13, we see that 


A») =/(«o) + (« - «o) [A%> «i) 

+ (m - ai){/(ao. «!• “ 2 ) + (“ - «2)(/(«o. «i. H "s) + - - •)}]• 
This equation may be written in the form 

/(«)=y!(«o) + (»-«oK (1) 

where =f{a,^ aj + (« - 

• • . • • ' • • 

Vr - rth divided difftoence -f (« - ar)®r+i. 


*'»• 


U« a,.a„), a constant. ^ 

The ®’8 are computed in the following order: «„_i, > 

The value o{f[u) is then obtained froill eqxiation (1). 
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Ex .—To find the function corresponding to the argument 6*417 in 
the following difference table ; 


A rgument. 

Entry. 




G'q = 5 

150 

121 



«i= 7 

392 

265 

24 

1 

«2 = 11 

1452 

457 

32 

1 

«3=13 

2366 

!)17 

46 


a^= 21 

9702 




u = 6'417, 

1 '3=1, ^*2=24 

+ (6-417- 

11)1 = 

19*417, 


- 7)19*417= 109-671)889, 

/(6*417)= 150+ (6*417 - 5)109*679889 
= 305*416402713. 


16. Divided Differences with Repeated Arguments— The 

original definition of divided differences presupposes that the 
arguments concerned are all different. I 1 ‘, however, the 
quantity /{uq, + c) tends to a definite limit as c tends to zero, 

we denote this limit by /{(Iq, Oq), and similarly for divided 
dillereiices of higher order. 

Now suppose that in § 13, =% Since the differences of order 
3 are supposed constant, we see that/(^/o» ^o» ^ 2 ) equal to 

/(^o,^ 2 , ^ 3 ), and the remaining differences ((q, 
may then be calculated just as in the general case when u and 
were supposed different. We may now form another set of 
differences by again taking u = %. Repeating this method, we 
obtain the following table of divided differences: 


Argwnent, Entry. 
«o /(«o) 

«o /(«o) 

«i y{«i) 

.®2 

«8 y(«s) 


y^<*o> ®o) 

/(«0. ®o) 

4«0. “it 
/(®1. ®2) 


reo, flo) 

A%> ®o. «i) 

AHy ®i. «2) 
a^, a^) 


y(«o. "0. ®o. «o) 
/(rt'o. "0. ®o. «i) 

A%> «o. ® 1 * « 2 ) 

/((Tfl, orj, ff,, a,) 
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In terms of these divided differences with repeated arguments 

the formula of h'ewton becomes 

/(u) =/(a„) + (u - “o) + (« - «o- ®o) 

+ (m - «o)®/(®o> ®o. «o) + • • • 

This formula will be used later to obtain an expression for 
the derivatives of a function in terms of its divided differences.* 

Hx—Givm the values ^ to find fix) 

in terms of powers of (x - 3). 

Constructing a table of divided differences and extending it to include 
repeated arguments for x = 3, we obtain 


X. 





42 

68510 

4113 



34 

35606 

2613 

100 

1 

27 

17315 

1026 

69 

1 

11# 

899 

146 

40 

1 

5 

23 

00 

16 

1 

3 

- 13 

2 

8 

1 

3 

- 13 

2 

6 


3 

- 13 





Applying Newton’s formula for repeated arguments, the required 
value is f(x) = — 13 + 2(a5 - 3) + 6 (ic - + (x — 3)®. 

f 17. Lagrange’s Formula of Interpolation.— J^et jfix) be 
the polynomial of degree n which for values a^, a^, a^, 
of the argument x has the values yi[ao)> • • • > A'^n) re¬ 
spectively. By the definition of divided differences, we have 
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Since y(*) is a polynoi^l of degree < its divided differences of 
order (n +1) are zero, i.e. 

<l2> • • • ®n> S') = 0. , 

Arranging the factors of the denominators in the above 
fractions so that the first factor in each denominator is of the 
form (» - a,), we obtain 

_ 

(» - ao)(» - (ti) • • • (» - «n) 

__yw)_ 

{x - -'rtg). . . - «,0 

^__ 

(aj - - Ho) a,,)' 


■/(«») 


^ (a: - «„)(a„ - a^) . . . (a„-a„_i)’ ^ 

which is Lagrange’s formula in a form suitable for computation.* 
Another way of writing this formula is obtained by multi¬ 
plying both sides of equation (A) by 


when we obtain 
A^)=i 


t - rto)(a: - «-^{x - ffg) 

. . . (x-a. 

{x - a^{x - ffg) . . . 

(x-a„) ^ 

K-«i)K-«2) • • 

e 

1 

O 

{x-a^{x-a^ . . . 

(x - a^) ^ 

K-«o)("i-«2) • • • 

{x - <h){x - ai) . . 

. (a:-a„.i) 


A%) 


(«n-ao)K-«i) • • • K-a»-i) 


/(««)• 


(B) 


It is important to note that when a set of experimental data obey a 
law wliich can be expressed algebraically as a polynomial of degree w, 
then not less than (w + 1) obseiwations are reqnii*ed in order to cnnstriict 
the polynomial. If only n values were nsc*d, the resulting polynomial 
would be of degree (n - 1). Before applying the Lagrange formula it is 
therefore necessary to ascertain the order of the divided differences which 
are of constant value and thus find the pmper value for n. 

I Ex, 1. —Given the values 64*o|44^0 SD**! value 

of f(x) corresponding to 27, 

* This formula, generally known as Lagrange’s, was discovered by £. Waring, 
Phil,, Trane. 69 (1779), p. 69. 
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Applying formula (A), we obtain 

__ 

(27 - 14) (27 - 17) (27 - 31) (27 - 36) 


68-7 


OP 


+ 

+ 

/(27)_ 

4160 


(27 - 14) (14 - 17) (14 - 31) (14 - 36) 
64-0 

‘ (27 -17) (17 - 14) (17 - 31) (17 - 36) 
_44-0_ 

(27 - 3l)(31 - 14)'(3r-i7)'(3i - 36) 
39-1 

(27 - 36) (35 - 14) (36 - 17) (35 - 31) 
68-7 64-0 44-0 39-1 


13923 7560 3808 12096’ 

/. /(27) = 49*317 (approx.). 

The required value is 49*3. 

2 ; 0 1 2 5 

Ex, 2. —Given the data ^ ^ to form the cubic function of 

Applying formula (B), we have 

„ (g-l)(a!-2)(a-5) !c(x-2)(x-5) x(,x-l)(x-6) 

(0-l)(0-2)(0-6) l(l-2)(l-6) 2(2-l)(2-6) 

g(a!-l)(g!-2) y 
8(6-1) (6-.2) 

=a?+!B®-ai + 2. 

^ 18. An alternative proof of Lagrange’s formula by the 
4 use of determinants is the following: 

Let Pn denote a polynomial of degree n, and put 

P„=A + Ba! + Ca!* + . . . + La?* 

-/(*)• 

Substituting in succession the values o^,for x, we 
obtain , 

7(ao) = A. + Btto + Ca„*+ +Loo^ 

j{a^ = A + Bffli + Ctti* +. . . + Lui", 


J{a^ - A + B«„ + Ga»< + . Ley*. 
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Eliminating A, B, C, . . . from these equations determinant- 
ally we have 


Pn 

y(®o) 

^«i) 

.A«2) . • 

• /(flSn) 

1 

1 * 

1 

1 

. 1 

X 

«0 


«2 • • 

• an 


< 


. . 

. a,,* 

cc" 






Expanding this determinant according to the elements of the 
first row, we see that 


P» 

1 

1 

. . 

. 1 

-/(«o) 

1 

1 . . 

1 





a© 

«i 

. . 

. On 


a; 

fti . . 

. ftn 








. 


05^ 

ft/ . . 

a/ 





«o" 



. ft,," 


aj" 

ft/ . . 

. ftn" 





1 

1 

1 . 

, . 1 

+ . 

. .+(- 


1 

1 . . . 

1 



X 

tt© 

«2 • 

. . «n 




a; 

ft© . , . 

ftn —1 



7? 

V 

ft./ . 

. . ft/ 





ft©® . . . 

ft®n-i 



a-n 

«o" 

ft/ . 

. . «n” 




a;" 

«©" . . . 

ft"»_l: 


The determinants in this equation may be represented 
as diiference-products. The coefficient of f(a^ is the differ¬ 
ence-product of X, dj, . . the coefficient of J\a^ is the 

difference-product of x, and so on. We may 

write 


1 

1 . . 

1 

= - 

1 

1 

1 . . 

. 1 

= + 

1 

1 

1 . 

. 1 

ao 

«i .. 

ftn 


ftl 

ao 

ft© • • 

ftn 


fta 

ao 

ftl . 

. ftn 

• 

• • 

• 


• 

• 

• • 

• 


• 

• 


. 

ft©" 

a/ . . 

. an" 


ftl" 

ft©" 

fta". . 

. ft-n" 


ft©" 


ft,". , 

. . ftn" . 


i.e, the coefficient of P» is equal to the difference-product of 
«o» • • •, : it is als^ equal to minus the difference- 

product of «!, ^ 0 , ^ 2 ,. . dny or to plus thc difference-product 
of a^y . . a„, and so on. If we now divide through- 

ouf by the coefficient of P„ in equation (1), we obtain the 
result; 
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{x-<ii){x-a^){x-a3) 

K-«i)K-« 2)K-«8) 


(«-«n) 

(ao-®n) 


\f{a) (‘g-<^o)(‘”-^2)(«-«8) • • • {i«-an) 
^^ai-ao)K-«2)(®i-«s) • • • K-®») 


„ {x-a^){x-a^) ■ . ■ {x-an,i) 

• +/V . . . (a„-a„.i)’ 

which is the formula required. 

There is an infinite number of functions of x, each of which has the 
values /(^q), /(a^), . . /(a^) at a^, a^, . . respectively. 'In the 

practical applications of mathematics, however, we consider only functions, 
such that if o,n sufficiently close together, any one of the 

functions may be represented with tolerable accuracy by the polynomial 
P„, for the range of values included between a^, a^, . . a^. The 

formula may thus be used, for interpolation. 

19. The Remainder Term in Lagrange’s Formula 
of Interpolation.* —Let J{x) be a function of the real 
variable x defined in an interval to which belong the values 
x^ Xi, . . ., Xa, and possessing in this interv6d the derivative 
of order n. 

Consider the function ff(x), where 


^(®) = 

A^) 

a?” 

. . 

. X 

1 



n 

»o"'^ . . 


1 


./(«») 

/>• n 

. . 


1 


The determinant vanishes for the values % x^,. . ., x„. By 
the differential calculus we see that since ff(x) vanishes for 
(ft + 1) values of x, its derivative ff'(x) vanishes for n values of 
X, the second derivative for (n - 1) values, and so on; the nth 
derivative vanishing for one value of a; in the interval. Thus- 
there ensts a value x intermediate between XgiX^, . . x„ such 
that (/"^x) = 0. 1 

Forming the nth derivative of the determinant by differ, 
entiating the variable elemento of the first row, we have: ^ 

* Feano, Seritti offerti ad E, iff Ovidio (Torin, 1918), p. 388. 
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/“(») n! 0 


0 


0 

1 


= 0 . 


W®«) ®«"-^ -.. ®« 1 


If we expand this determinant according to the elements of 
the first column and solve for in the resulting equation, 
we find 

■ftx 1 = (^0 ~ ^a) (^0 ~ ^a) • • • ~ j^n ) 

(®i-«2)(®i -®8) • • • (a-’i-®!.)' 

^(®2-^'i)K-« 3) • • • {i«2-»»r * 




( a;o-a;i)( a ;o-a! 2) ■ . . K^n-i) v 
-»2) • • • 

^ (^0 ~ (^0 - ^^ 2 ) • • ♦ (^0 ~ 

where x is some number intermediate between Xo,Xi,.. x„ 
This is Lagrange's formula with a remainder term. 


Examples on Chapter II 
1 

1 . If /(a) = find the divided differences/(a, 6),/(a, 6, c), and/(a, 6, c, d). 

2. If /(ic) = ^(aj)4-^(a;), where fjf(x) = a;* and A(jc) = u:;®, verify that 
f(5, 7, 11, 13)=:(/(5, 7, 11, 13) + /K5, 7, 11, 13). 

3. Given the values 

a; 4 5 7 10 11 13 

jif) 48 100 294 900 1210 2028 

form the table of divided differences and extend it to include the values 
of the function for a; = 3 and a:= 14. 

Find the function /(a;) in each of the following cases : 

, (a) X 11 13 14 18 19 21 

/(aj) 1342 22d0 2758 6860 6878 9282 

(6) a; 16 17 10 23 29 31 

66536 83621 130321 279841 707281 923621 

by means of a table of divided differences. 
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Calculate /(I), given the values 

a; 0 2 3 6 7 9 

/(«) 668603 704969 729000 804367 830684 884736 

Assuming/(a?) to be a function of the fourth degree in a;, find the' 
val^ofy(19) from the values 

X 11 17 21 23 31 

/(a;) 14646 83526 194486 279846 923626 

^^7. The values of a cubic function are 150, 392, 1452, 2366, and 
6202, corresponding to the values of the argument 6, 7, 11, 13, 17 
respectively. Apply the Lagrange formiila to find the function when 
the argument has the values 9 and 6*5 respectively. 

8. Find an expression for the function in each of the examples (6) 
and (7), using the Lagrange formula of interpolation. 

9. If <P(«) ==/(«)l/(aj), 

obtain the formula (analogous to Leibnitz’s formula for the nth derivative 
of a product) 

ft 

• • *» ^o) = ^ /(^n> • • •> • • • ^o)* 

A :»0 

(Steffensen.) 
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CENTRAL-DIFFERENCE FORMULAE 

20. Central-Difference Notations. —In this chapter we 
shall consider certain formulae of interpolation which employ 
differences taken nearly or exactly from a single horizontal line 
of the difference table. In order to express these simply it is 
convenient to modify the notation of the calculus of differences. 

Several systems of modified notation are in use. One, 
which we shall frequently employ, was introduced by W. F. 
Sheppard* and will be understood from the following difference 
table. It is based on a symbol 8 which may be regarded as 
equivalent to AE “ where E as usual denotes the transition 
from any number to the number next below it in the difference 
table, i,e, E = 1 + A. . 

Since 5=AE“^ and therefore A=8E^, we may write 
A^w^, = 5*wp A®i4o = S%^,. . A'% = 5"«^and so on. 

Rewriting the ordinary difference table, we obtain 


Argument, 

Entry, 





a-2w 








8u - 










a-w 

^-1 









8»«.j 


a 

^0 




8*«o 







a + w 





8*Wi 







a + 2w 

^2 


8*M2 


8^2 


If we suppose each row of^the difference fable to be numbered with 
the suffix p of the corresponding entry Up^ or, in the case of a row 
situated midway between two entries Up and Wp+i, to take the number 
we see that A®*’w#u the differenoes of even order of Wq, are repre¬ 
sented in the central-di^ence notation by since they are situated 
* Proe, London Math, Soe, 31 (1899), p. 459. 
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in the row r. The differences of odd order aie represented by 

the expression since they lie in the row r + 

It is often required to find the arithmetic mean of two 
adjacent entries in the same column of differences. In the 
8 system of notation we indicate this mean by the symbol /*. 
Thus ijl8Uq is defined to be ^ + 8u^), /x8®Wq is ^ 
and so on for the mean differences of the other entries. The 
mean differences may be inserted in the table to fill in the gaps 
that occur between the symbols of the quantities from which 
they are derived. 

In another notation which was suggested by S. A Joffe* the 
symbol ^ is used instead of 8. The notation is illustrated in the 
following difference table: 

Argument, Mvdry, 



«_2 

A«-i 



a-w 


A*“_i 

A*»-j 

A«-i 

a 

«0 

A®«o 

AS> 



A«j 

A®wi 

A®»j 

a + w 

«1 

A«j 



a + 2ti7 

«2 




21 . The 

Newton-Gauss 

Formula 

of Interpolation. 


Suppose that a function/{a+ is given for the values 
. . . a - a, a + 24?, a + 2w, . . . 
of its argument. 

If in the Newton formula for unequal intervals we take 

a^ = a-\-2w, ^4 « a - and so on, 

and denote a + xwhy u, we obtain 

a + w) + - a) (w - a - w)f{<i, a + w^a - w) 

+ (w - a) - a - - a + w)f[a, 2w) 

+ (w - a) - i/;) ( 1 ^ - a + w) (i 4 - a - 2w) 

f(a, cr + w, a^w, a + 2w,a^ 2w) 

+ - a) - a -(w - a +- a - 2w) (« - a; + 2w) 

f{a, a-^w, a-w, a + 2w, a--2w, 3w), 

(1) 


Hh • • • 


* Trana, Act. Soe. Amer. 18 (1917), p. 91. ^ 
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The divided dififerences contained in this equation may be 
written in the ordinary notation of differences as follows; 

f(a, a + ‘w,a-w) = - w), 

f{a, a + io,a-w,a + 2w)^ ~ 

■ etc. 


Equation (1) thus takes the form 
/(a + iTiw) =/(«) + x^f{a) + - A’y(a - w) 

^ (x + 2) (x- + iKa! -1 ) 4 .. . . (A) 


This formula, which is one of the group of formulae known to 
Newton, is often called the Ggms fo rmula. 

The diffevenees used in this formula are as nearly as possible in the 
horizontal line through /(a) in the original diffemice table. Tlie formula 
is therefore convenient for use when the value of the aigument for whic* 
the function is required is near the middle of the tabulated values. Ihis 
formula may be represented more simply by using the symbol (n)r to 
denote the binomial coefficient 

n(?i-l)(?i-2) . . . (7i~r+l) 
rl ’ 


so that it may be written 

■' +(*+l)4A«/(a-2w) + (*+2)5AS/(o-Sfa>) + . . - (B) 


22. The Newton-Qauss Backward Fomrala.— From fche 
formula of the last seotiqp another may he derived which 
is often used when » is measured in a negative direction 
from /(a), ie. towards decreasing values of the argument. 
Sifppose we write /(a - xw) in the form/{<t + *( - to)} and change 
the sign of to in the discussion of the last section. The 
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order of the arguments and corresponding entries is then 
reversed. Instead of Aj\a) in the Newton-Gauss formula we 
now have/(a - w)-/(a), or -£i/{a-w); £i^J{a-w) in the 
above formula becomes - Ayi(a - 2w ); Ay(a - 2w) becomes 
- A^a - Zw), and so on. We thus obtain the formula 

f{a - xw) =/(«) - xLf{a - w) + {x)^b?‘/{a - vj) -{x + l) 3 A^a - 2w) 
+ (x + l) 4 A*yi(a - 2w) - (a: + 2)^^^f{a - 3w) +. . . 


which has been called the Newton-Gaitss formula for negative 
interfdaiion, or the Newton-Oauss haekward formula. 

23. The Newton-Stirling Formula. —In the Gauss formula 

f{a + xw) =f{a) + ®Ay(a) + \x{x - 1) A^« - w) 

+ J(a: + l)a;(a; - l)A^a - w) 

+ VI +1)®(® -1) (« - 2) AV(« - 2to) +. . . 


the terms may be rearranged thus: 

f{a + xw) =f(a) + x{^f{a) - \b?f{a - w)} + j,fi?f{a - w) 

+ gy—- WiP' - 2w)} 


+ . . . 


Suppose we replace the differences of even order within the 
brackets by differences of odd order, using the identities 

A*yi(a -w) = Ay{a) - ^f{a - w), 

A^a - 210 ) = A^a -w)- ^f{<i - 2w), 

and so on. We obtain the result 


+ a m. - «■) H- - 2») ^ . 

6. It ^ 4 . 

+ (a^ - 2^) A‘y(ffl - 2w) + Ay(tt - 3w) 


(A) 
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This formula, which was first given by Newton,* was after¬ 
wards studied by Stirling f and is called the Newton-Stirling 
formula. 


Tlie mean-differences A/(a) + A/(a - A®/(a - 2w) + A®/(a - w)}, 

etc., are completely symmetrical with regard to increasing and decreasing 
arguments. This fact enables us to expreas the formula very concisely 
by means of the central-difference notation of § 20 ; 




- « 0 ++ 


31 ' 




where fi8uQ= + Smj), 

/i8*Uo=|(8®it..j + 5*«5), 

and so on. 


(B) 


24. The Newton-Bessel Formula.—In the Newton-Gauss 
formula 

J{a + xw) =/{a) + xA/{a) + lx(x - 1 ) A^« - w) 
l(x + l)x{x - l)A®/1(rt - If) 

+ -^{x-{-\)x(x~\)(x-2)Ay\a-2w)-^. . 
let us substitute for iA^a - w), 1 A*/l(a - 2w), etc., their 

values obtained from the identities 

f{a)=f{a + 'i(^-Af{a). 

A^^rt - w) = A^JXa) - A?J{a - ir), 

Ay(a - 2w) = Ay(a - w) - Ay(r6 - 2io), 
etc. 

The above equation becomes 

f{<i + xw) = \ {/(a) +yi(a -1- w)} + (® - y)AJ{a) 

+ - «’) + AM®)} + ^^AsyKa - w) 

+ + - 2 «-) + AM® - w)} +. . . (A) 

which is symmetrical with respect to the argument (a + Jw). 

This formula, which wag first given by Newton % and later 
used by Bessel, is called the Newton-Bessel formvla. 

* Newton, Methodus Differentialis (1711), Prop. iii. Case i. 

4 Stirling, Methodus Differentialis (1730), Prop. xx. 

{ Methodus Differentialis (1711), Prop. iii. Case ii. ; Stirling, Methodus 
Differeniialis (1730), Prop. xx. Case ii. 
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If in this formula we write a; - J = it becomes 

/(a + iw + +/[« + «')} +yA/(«) 

+ y^l{L^f{a - w) + A?/(a)} + - w) 

+ - 2w) + A«/(a -«>)} + . . . (B) 

» 25. The Laplace-Everett Formula.—When it is required 
to interpolate between /(a) and /(a + w) in the construction 
of tables by the subdivision of intervals, statisticians often use 
a formula due to Laplace and Everett,* which may be written 

where denotes f{a-\-xw), and ^ denotes (1 - a?), and where as 
usual 8* denotes A^E“^ Thus for a? = ^ = 1 -1 = J. 

This formula involves only even central differences of each 
of the two middle terms of the series between which the 
interpolation has to be made. 

To prove this formula we eliminate from the Newton-Gauss 
formula 

f{a + xw) = /(a) + a;A/(a) + {x) 2 ^^J\a -w) + (x + - w) 

+ (a;+l)4Ay(a-2«;) + (a? + 2)5Ay(a-2w;)+. . . 

the differences of odd order by means of the relations 

A/l^) =/(^ + ^0 ‘-/(^)» = Ay(a) - Ay(a - w), 

. Ay(a - 2w) = Ay(a - w?) - Ay(a - 2%o) . . . 

The Newton-Gauss formula becomes 

w/ y y 

/(a + xw) =J{a) + x{f{a + w) -/(a)} + {x)J^^f{a - w) 

+ (* + l)8{AVl(a) - A®/(a - w)} + (a; + - 2w) 

+ (a! + 2)5{AV(a-w)-A*/(a-2w)} + . . . 

Using the relation (p + l)«+i = (p) 4 +i + (p)?. this equation 
may be written 

* Laplace, Thiorie anaL dea Prob,, p, 15; Everett, BrU, Aaaoe, Rep, (IfilOO), 
p. 648; JJ.A. 85, p. 452 (1901). Tables of the co-efficients have been published 
,in Trwta for Computera, No. V. 
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/(a+aw) = (1 - x)f(a) + x/(a + w) + (® + l)8A*yi;a) - {x) 3 ^y(a - w) 
+ (a; + 2)5AV(a-w)-(a! + l)5A*y(a-2w) + . . . 
Introducing central differences and rearranging the terms, 

/l(a + a;w) = (l-a:]l/(a)-(®)3SVi(a)-(a: + l)68*yi(a)-. . . 

+ xf(fl + w) + {x+ + «>) + (» + + «?)+. . . 

If we now transform the coefficients of f{a) by means of the 
relation 1 - » = so that {x)^ = - (^ + 1 ) 3 , (x + 1)5 = - (5 + 2 ) 5 . 
etc., we have 

f{a + xw) = £/(«) + (^ + l) 3 ^f{a) + (^ + 2 ) 5 S«y(a) +. . . 

+ aj/i(a +w) + (»+l)38*yi[« + w) + (a: +2)5S^a +w)+ . . 

which is Everett's formula for equal * intervals of the argument, 

26. Example of Central-Difference Formulae.—The fol¬ 
lowing example illustrates the various central-difference formulae:f 

To compute the value of log^Q cosh 0*3655, having given a table of values 
of Zo^iQ cosh X at intervals 0*002 of the argument. 

Forming the difTerence table, we see that the diflcrences of the third 
order are approximately constant. The difFerences of the fourth order 
will, however, be taken into account since such a difference may affect 
the accuracy of the last figure of the result. 


Argument. 

JSntry. 



0-360 

0 0276 5462 3980 




30061 3825 



0-362 

278 5623 7806 

152 8035 



30214 1860 


-2122 

0-364 

281 6737 9666 

152 5913 

-13 


30366 7773 


-2135 

1-4. 0-366 

284 6104 7438 

152 3778 

-3 


30519 1651 


-2138 

0-368 

287 6623 8989 

152 1640 



30671 3191 



0-370 

290 7295 2180 



In EveretVs formula put = ^ = J, and 

0-0281 5737 9666. 


/(0-3655) = J(281 6737 9665) + (- 52 5913) + -^^4 - 13) 

+1(284 6104 7438) + (- ^)(162 3778) + (- 3) 

= 283 8613 0494-75 - 14 2937-69 - 0-13 = 283 8498 7557-03. 
.*. log cosh (0-3666)^ 0-0283 8498 7657 . 

* Corresponding formulae for unequal intervals have been given by R. Tod- 
hunter, J,I.A, 50 (1916), p. 137, and by G. J. Lidstone, Proc, Edin, Math, 80 c, 
40^1922), p. 26. 

t A valuable set of worked-out examples is given by L. J. Comrie in the 
Nautical Almanac for 1937, pp. 931-934. 
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In the Ne/wton-Bmel fornnila put 

A0’3e66)=« 6104 7438j'*'i^®®®®® 7773) 

/ 152 5913\ 

+ (“¥ 2 )!^+ 152 3778^ ~rl8(~2136) + ^j7i(- 13 -3) 

= 28309213551-5 + 75916943-26 - 142954-27 + 16-68 - 0-14 
= 283 8498 7667-02. 
log cosh (0-3666) = 0-0283 8498 7567. 

By the Newton-Gauss formula 

/(0-3665) = 281 5737 9666 + f (3 0366 7773) + ( - JV) (152 5913) 

■*'(~Tl«)(“2136)-fTj^^lf(— 13) 
= 281 6737 9666 + 2277 50829-76 - 14 3064-34 

-H16-76-0-22 

= 283 8498 7566-95. 
log cosh (0-3665) = 0-0283 8498 7667. 

By the Newton-Stirling formula 

/(0-3666) = 281 5737 9665 + f -’^^3 ^®®g^ + /^,(152 5913) 

+ (~ T¥ir)-i^ _ 2135 ) ^ ff) (- 13) 

= 281 5737 9666 + 2 2717 8612-38 + 42 9163-03 

-1-116-40 +-13 

= 283 8498 7656-94. 
log cosh (0-3665) = 0-0283 8498 7557. 

27 . The Formulae of the preceding Sections may be 
expressed more concisely by means of the Central- 
Difference Notation of § 20. 

Everett's formula: 

= + , . . + (f + ^)2r+l5^'’^0 + • • • 

+ xUi-^{x^l)^8Hj^ + {x + 2)^S^Ui-h. . . + (a? + r) 2 r+iS^'’«^i +. . . 
The Newton-Bessel formula: 

«»+ (® - -1- (a;)2/a8*Wj + 

+ {X + 1 )^^ + + . . . 

+. . . + (® + r - l)arAi^«j + (a? + r - +. . . 

The Newton-Gams formula: 

tt* = Mo + ®8» j + (®)*8 ®Mo + (« + 1)88®m^ + (« + 1)48*Mo -h (® + 2 ) 58 ®m^ 
+ . . . + (» + r - l)2r8*’Mo + (® + + • " • 
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The Newton-Stii'ling formula: 

% ° + g/tawo + 8S> 

+ J{(a5 + r)g, + (a! + r - l)2r}6®''iiio + (® + »’)2<+ • • • 
Newton-Gams backward formula: 

«_* = %- xSu _ j + (a;) 2 S*tf„ - (a: + l)35®w _ J 
+ (a: + l)4S«Mo-(a: + 2)68®w.j + . . . 

+ (a: + r-l)jrS^Mo-(a: + r)2,+i8*^+^_j + . . . 

28. The Lozenge Diagram. —Wc slrall now give a method 
which enables us to find a large number of formulae of 
interpolation. 

Let {p)q denote the quantity 
entry/(a + o'w). We obtain at once the relations 

(P)t = (?' + l)?+i - (V)g+i> (1) 

A«w _ ,+i - A%., = A^+hi.n (2) 

and, combining these equations, we see that 

{p),{AlU.r+i - A«W.r} = {(/> + l)g+l - {l>),+i}A'>+hl 

or 

(p)g^9u.r + (?? + = {p)qA^U^r+i + (2?)^+iA^+%_r- (3) 

Suppose we arrange these terms in the form of a “ lozenge ” so 
that the terms on the left-hand side of the equation lie along 
the two upper sides of the lozenge and the terms of the right- 



Fm. 2, 
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hand side along the lower sides. We obtain the above diagram 
in which a line directed from left to right joining two quantities 
denotes the addition of those quantities. 

Equation (3) may be expressed by the statement that: in 
travelling from the left-hand vertex to the right-hand vertex of 
the lozenge in the diagram^ the mm of the elements which lie 
along the upper route is equal to the sum of the elements which lie 
along the lovm route. 

It is evident that this statement may be extended. For 
example, let us place in contiguity the lozenges correspond¬ 
ing to 


"p^n 


'p = n-l 

?=i 


"p = n 
? = 2 


\r=1 y v?"=0 y Vr=ly 

so that the upper vertices of the lozenges, which are of the form 
{'p)qis!^u^.ri form a sort of difference table: 

We obtain the following diagram: 



Fra. S. 


Applying the rule given by equation (3), it is evident that 
the sum of the elements along eith/ur of the following routes is 
the same; 

«o + + (m + 1)2A®«_i + (» + 1)8A*M -1, 

tto+(«)iA«o +(w)2A%_i + (Jl + 1)3 A%-i, 

®o+(™)iA«o +W 2 AX +(n)8A»M-i. 
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Since += +we may form three 

other expressions beginning with the term instead of and 
equivalent to those already given, namely, 

. + (n - l)iAi^o + (7i)2A%_i + {n + 

and two similar expressions. 

If we examine the structure of this diagram, it will be seen 
that the values of q and r in the expression {p)qA^u.r are 
arranged in precisely the same way as for the differences 
iSsf{u - rw) in an ordinary difference table. The values of p are 
constant along any diagonal descending from left to right of 
tlic diagram, while along a diagonal ascending from left to right 
these values increase by unity at each vertex. The first value 
of p along either line radiating from Uq is taken tohe p = n. 

By extending this diagram we arrive at the following, which 



Pig. 4. 
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be called a lozenge or **Fraser** diagram since it is a 
modification of one due to D. C. Fraser.* 

Now the Gregory-Newton formula for Un is the sum of the 
elements from along the downward sloping line to the line 
of zero differences. So % = the sum of the elements from Uq 
along any route whatever to the line of zero differences. 

From the identity Wo += +^ evident 

that the value of is unaltered if a route is selected starting 
from % instead of from % general the sum of the dements 
along any route proceeding from any entry u^ whatever to the 
line of zero differences is equal to u^. 

Applying this rule, we have at once from the lozenge diagram 
^n— “O + + (^ + 1 ) 2 A 2 w _2 + {n + 2)3A3w_3 

+ (71+3)4A4m_4 + . . . (4) 

1 A^=Wo+(w<)iAw_ 1 +(w+ l)2A2?4_i+(n+ 1 ) 3 A®W _2 

+ {n + 2\A^tc_^ + . . . ( 5 ) 

Wn=Wo +WiAwo + (n) 2 A%_i + (w+l) 3 A®M_i + (n+ 1 ) 4 A^ia _2 + . . . ( 6 ) 

Vtfi =+ (W' — 1)jA'Miq + (ii) 2 A®Wq + (W') 3 A®'a_+ (ti + l)4A^'i4_^ +, . , (7) 

Eewriting equations (5), ( 6 ) in the central-difference notation, we find 

Wn=Wo+Wl^'**-i + (^+l)2^%+(^+^)3^®^-J + (^+2)484w3 + . . . 

and 

= Uq + (n)iSMj + {n) 2 ShiQ + (n l)35®wj -I- (71 + 1 ) 48 *m .3 -t-. . . 

which is the Newton-Gauss formula. 

If we now take the mean of these values of we obtain the formula 
whose differences are along the row corresponding to Uq : 

“ «o+j+Stti) + i {(« + 1 ) 2 +Wj} ^*0 

+ (»+ 1 ) 3 + 8 ®«j) + + 2 )^ +(» + 1 )^} 8 *Uq +. . . 

m 

«.=«o + (”)i/* 8 mo + - l)8^o +. . . 

which is the Newton-Stirling formula. 

The mean value of from equations ( 6 ), (7) may be expressed either 
as Everett’s formula or as the Newton-Bessel formula. Writing ( 6 ), (7) 
in the central-difference notation, 

+ (w + r — l) 2 y 8 ®^Wo + (n-fr) 2 r 4 .i 8 ®»‘'^%j-l-. . . ( 8 ) 

tin «Wi + (w--l)i 8 Mj-i-( 11 ) 28 *^ 1 -I-(ri)38®74-!-(w-fp;484ttj+.. ... 

+ (n + r - l) 2 f 8 ®»‘«i + (n + r - l) 2 r+i 8 ®^‘*‘^wj +. . . (9) 

Taking the arithmetic mean of these values of %, we may eliminate 
♦ J,IA. 48 (1909), p. 238. 
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differences of odd order by applying the relations (p), = (/> + l)g+i ~ (p)g+i 
and = - 8 *'*% The coefficient of lakes the fonn 

^{(n + r- l)2r + (^ + r)2r+i + (w + r-or (w + Ogr+i- The co¬ 
efficient of becomes ^{(n + r - l)2r -{n + r)2r+i -(n + r- l)2r+i} 

or - (n -H r - l)2r+n and by Bubetituting ^ for (1 - n) we see that 

— (w- + r - l)2r+i =® — (r — ^)2r+i = (^ + ^)2r+l* 

The arithmetic mean of equations (8), (9) may thus be written in the form 

= +(^+1)8S^Wo4-4-2)5S%o + . • • + (^ + r)2/*+i^^*‘«o + « • • 

+ utAj + (^ 4“ + (n» + 2)g8^Wj^ 4* • • • 4* 4" ^’)2»'+4". . . 

which is Everett’s formula. 

Suppose, how'ever, we find the arithmetic mean of the values of 
in (8) and (9) and simplify the coefficients of differences of odd order in 
the i-esulting expression by means of the relation 

ft ^ 

i{(« + ’■)2/-+i +(» + »•- l)2r+i} =(n + r- IV • J• 

We now obtain the result 

Un = fJtu^ + (n- J)8 j 4- {v )2/a82 ^ ^ J-4- (h 4* 1 )4/a 8* itj 4-. .. 

^ X 

+ (ji + r-lV/iS*»'aj+(»i + r-+ . • ■ 

which is the Newton-Bessel formula. 

29. Relative Accuracy of Oeutral-Difference Formulae.-- 

It is frequently necessary to use approximate formulae which 
terminate before the column of zero differences is reached. From 
the last section we have seen that the sums of the elements along 
any two routes which terminate at the same vertex are identical. 
If the routes terminate at two adjacent vertices {p),jA^u.r+i and 
{p)qJS.^u^r which are in the same "lozenge,” the sums of the 
elements along these routes differ by i,e. 

Extending this result to routes terminating in 
th\j same column of differences, for example, at A*w _3 and A^^, 
it is evident that the sums of the elements along these routes 
differ by {n 4- 2 ) 4 A®w -3 4- (n 4- l) 4 A®w «2 + (^) 4 A®u.i. 

We shall now consider routes that lie along horizontal 
lines; these yield the formulae containing mean-differences. In 
the last section it was shown that a mean-difference formula is 
obtained by taking the ariVimetic mean of the elements along 
two adjacent routes. From the mode of formation we see that 
the sums of the elements along such routes are identical as far 
as the vertices at the intersections of the routes. For example, 
the Newton-Gauss formula is equivalent to the Newton-StirUng 
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formula as far as differences of even order, and it is also 
equivalent to the Newton-Bessel formula as far as differences 
of odd order. When a formula is curtailed, the question arises 
as to whether it is more advantageous to select a route which 
terminates at a mean difference or at an ordinary difference. 

The following diagram represents the portion of the lozenge 
diagram along the row corresponding to and adjacent to 
the differences of order 2r. Let A denote the mean differ¬ 
ence {n + ^ denote the mean difference 

(rn- r - 



The route along the dotted line through A represents the 
Newton-Stirling formula and the route along the dotted line 
through B represents the Newton-Bessel formula. The Newton- 
Gauss formula, which is represented in the diagram by a zigzag 
intermediate route, is equivalent to the Stirling formula at the 
vertices and it is also equivalent to the Bessel 

formula at the vertices and 

Consider the three routes representing the Gauss and the 
Stirling formulae and the formula which contains the differences 

and If we suppose these 

formulae to be curtailed so that the last difference of each is of 
order 2r +1, we may compare the accuracy of these formulae 
by ascertaining the magnitude of the neglected terms of order 
(2r + 2). The sum of the elements along either of the routes fiom 
the common vertex to the line of zero differences being 
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the same, the most accurate formula is the one in which the 
neglected term of order {2r + 2 ) is the smallest. These terms are; 
{n + I {{n + r) 2 H -2 +{n + r + 

{71 +r+ l)2r-l-2^'*+X 

respectively, and they are also arranged in ascending order of 
magnitude. The Newton-Gauss formula is therefore more 
accurate as far as mean differences of order (2r+l), when 
further terms are neglected, than the corresponding Newton- 
Stirliiig formula passing through the same differences of even 
order; and both are more accurate than the formula containing 
the difference In precisely the same way we see 

that the Bessel formula is more accurate than the Gauss 
formula as far as differences of even order when further terms 
are neglected. In general, a ce7itraUdiffere7ice formula terminat¬ 
ing at a 7nean difference of the entmj Up is rnore accurate than 
a foi'mula which is curtailed at the corresponding central- 
difference of Upa7id it is less accu7'ate than a for7nula udiich 
is cuodailed at the corresponding difference of 

We shall now illustrate by an example the superiority which central- 
difference formulae generally have over other interpolation formulae. 

Let it be required to find where — If we employ for 

this purpose an interpolation formula which proceeds according to 
central differences of Uq, and stop at the (2r + l)th term, the result is 
the same as if we employed Lagrange’s formula with given values of 
u_rf U_r+i9 • • Wrj so that by § 19 the error is 

(a: + r)(3! + j--l) . . • (^ - r )f,2r+iue\ 

(2r+])r ■' 

where ^ denotes some number between a — rw and a + rw. If, on the 
other hand, we employ the Gregory-Newton formula, and stop at the 
( 2 r + l)th term, the result we thereby obtain is the same as if we 
employed Lagrange’s formula with given values of Wq, Wj, . . ., sq 
that the error is 

x(x-l) . . . (a:-2r) .,a,+i), , 

(2r+l)! ■' 

where denotes some number l)etween a and a + 2rw. Now 
does not, in most cases, differ greaHy from but (a; + r) (a; + r — 1) 

. . . {x — r) is much smaller than flp(a5 — 1) . . . (sc — 2r) in absolute 
value when — ^<a;< J. Thus the error is smaller in the former case than 

* if detailed discussion of the accuracy of interpolation formulae is given ii» 
papers by W. F. Sheppard, Proc. Lond. Math. Soe. 4 (1906), p. 320, and 10 (1911), 
p. 139; D. 0. Fraser, JJ.A. 50, pp. 25-27: G. J. Lidstone, Trans. Fac. Act. 9 (1928). 
(d 811) a 
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in the latter. For this reason central-difference formulae are prefer¬ 
able to the ordinary formulae for advancing differences. 

The following remarks* are of general application : 

“ Formulas which proceed to constant differences are exact, and are 
true for all values of n whether integral or fractional. 

“ Formulas which stop short of constant differences are approximations. 

“ Approximate formulas which terminate with the same difference are 
identically equal. 

** Approximate formulas which terminate with distinct differences of 
the same order are not identical. The difference between them is 
expressed by the chain of lines necessary to complete the circuit.” 


30. Preliminary Transformations. — In certain cases 
formulae of interpolation should not be used until some 
preliminary transformation has been effected. We shall illus¬ 
trate this by two examples. 

Ex, 1.—Suppose that it is required to find L sin IS''. We have from 


a table of logarithms the following entries : 



0. 

L sin B. 




o 

1—• 

o 

o 

o 

5*6865749 

3010300 



20" 

6*9866049 

1760912 

- 1249388 

737864 

30" 

6*1626961 

1249388 

-511524 

231236 

40" 

6*2876349 

969100 

- 280288 


50" 

6*3845449 





The differences are evidently very slowly convergent. One reason for 
this will be seen when it is remembered that when 9 is small and 
radians, then sin a; = a;- Jx®-l-. . . and x=^sin 1" (nearly), 
so that L sin 0 = L sin I"-!-log 0 (nearly), and the differences of log 0 
for the values 10, 20, 30, 40, 50 ... of ^ are very slowly convergent. 
We therefore calculate L sin 6 when 9 is small by adding the inter¬ 


polated values of L 



which has regular differences, and log 9^ for 


which tables' exist with smaller intervals of the argument. 


Ex. 2.—Suppose it is required to interpolate between two terms of 
such a sequence as the following: 


1 r r(r+l)(r + 2)(r + 3) 

’ p p'(p+l)’ p(p+l)(p + 2y p(p + l)(p + 2)(p + 3y 

where r and p are two widely different numbers. c 

* D. C. Fraser, J,I,A, 43 (1909), p. 238. 
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It ia best to interpolate in the sequence of niinierators 
1, r, r(r+l), r(r+l)(r + 2), . . . 
and to interpolate separately in the sequence of denominators 

P0^+1)(p + 2) . . . 

We then divide the former result by the latter, in order to obtain the 
required interpolated value. 

Stirling {Methodus Differentialis (1730), Prop. xvii. Scholium) says: 
“ As in common algebra the whole art of the analyst does not consist 
in the resolution of the equations but in bringing the problems thereto ; 
so likewise in this analysis : there is less dexterity required in the 
performance of the process of interpolation than in the preliminary 
determination of the sequences which are best fitted for interpolation.*' 
The general rule is to make such transformations as will make the 
interpolation as simple as possible. 


Examples on Chapter III 

1. Given 

sin 25° 41' 40" ==0*433 571 711 655 565 
sin 25° 42' 0" = 0*433 659 084 587 544 
20" = 0*433 746 453 442 359 
40" = 0*433 833 818 219 189 

find the value of sin 25° 42' 10" by the Newton-Gauss formula. 

2. Find the value of log sin 0° 16' 8"*5 having given 

log sin 0° 16' 7" = 7*670 999 750 0 
8" = 7*671 448 629 9 
9" = 7*671 897 046 4 
10" = 7*672 345 000 2 

using the Newton-Gauss formula. 

Check your result by obtaining log sin 0° 16' 8"*5 from the following 
data: 

log sin 0° 16' 6" = 7*670 550 405 5 
8"= 7*671 448 629 9 
10" = 7*672 345 000 2 
12" = 7*673 239 524 3 

3. Apply the Newton-Stirling formula to compute sin 25° 40' 30*^ 
from the table of values 

sin 25° 40' 0" = 0*433134785866963 
20' = 0-433222179172439 
40" = 0-433309668404869 
sin 26° 41' 0" = 0-433390963663401 
» 20'= 0-433484334647243 

and verify your answer, using the Newton-Bessel formula. 
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4. Given 

log 310 = 2-4913617 
320 = 2-5051500 
330 = 2-5185139 
340 = 2-5314789 
350 = 2*5440680 
360 = 2-5663026 

find the value of log 3375 by the Newton-Bessel formula, verifying the 
result by one or more other central-difference formulae and comparing 
it with the true value. [3-5282738.] 

6. Show that the lozenge-diagram method really derives all the 
interpolation formulae by repeated summation by parts, i,e. by the use 
of the formula 

which is the analogue in the Calculus of Differences of the formula 

fudv = uv-~ fvdu 

in the Integral Calculus. 



CHAPTER IV 


APPLICATIONS OF DIFFERENCE FORMULAE 

31. Subtabulation. —An important application of inter¬ 
polation formulae is to the extension of tables of a function. 
Thus, supposing we already possess a table giving sin a? at 
intervals of 1 ' of x, we might wish to construct a table giving 
sin a? at intervals of 10'' of x. This operation is called sub^ 
tabvlation. Subtabulation might evidently be performed by 
calculating each of the new values by ordinary interpolation, 
but when the new values are required in this wholesale fashion 
it is better to proceed otherwise, forming first the differences of 
the new sequence of values of the function, and then calculating 
the latter from those differences.* 

Let Tq, Tj, Ta, T 3 , ... be a given sequence of entries in a 
table corresponding to intervals w of the argument, and let 
their successive differences be ATq = T^ - Tq, A*Tq = Tg - 21!^ + Tq, 
etc. Suppose it is desired to find the values of the function 
in question at intervals wfm of the argument so that (m - 1 ) 
intermediate values are to be interpolated between every two 
consecutive members of the set Tq, T^, Tg. . . . Denote the 
sequences thus required by . . ., so that = Tq, t^. = T^, 

hm = Ta, tzm = T 3 , etc., and let the successive differences in the 
new sequence be 

A]^^q = Aj*^q = ^2 “ etc., 

where A^ is used instead of A to denote the operation of 
differencing in the new sequence. The differences in the new 
sequence may now be found in terms of tbe differences in the 
old sequence by the use of operators in the following way. 

* Lagrange, (Euvres, 5, p. 663 (^792-^). Of. L. J. Gomrie, Monthly Notices 
R.A,8,, 88 (1928), p. 606. F. Emde, Zeitschr, /. angew. Math,, 14 (1934) 333. 
K. Camp, Trans. Act. Soc. Amer,, 38 (1937) 16. The discovery of subtabulation 
shoiild be credited to H. Briggs (1661-1631), who used it in bis computation of 
log. tables. 
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Denoting the initial value or T 0 by f{a), .we have by the 
Gregory-Newton interpolation formula: 

<i«yi(o + M>/m) = To+(l/m)iATo+(l/m) 2 A®To+(l/m) 3 A®To + . . . 
and the operators A^ and A are thus connected by the relation 
Ai=(l/m)iA + (l/»») 2 A®+(l/OT)jA* + . . . (1) 

Suppose for simplicity that A^Tq is the last non-zero difference 
of the original sequence, so that A^o = 0 > A*T 0 = O, etc. 
Equation (1) gives 

Ai* = {(l/m)iA + (l/m)2A® + (l/»n)3A» + (2) 

If we now substitute the values s = 1,2,3,4 in the last equation, 
we are able to determine all the differences of the new sequence 
in terms of the differences of the old sequence: 


(l-m)(l- 2 >«) ^ 3 j, 


- 24 ^- 




V^o--4^*To. 


2m* 


(4) 

( 6 ) 

( 6 ) 


When the differences are thus calculated, the entries t^, 
may be derived in the usual way by simple addition. The 
values of ^ 2 mi hm, • • • formed in this way should agree 
with the tabulated values T^, T 2 , T 3 , . . . 

Ex.—The logs of the numbers 1500 , 1510 , 1520 , 1530 , 1640 being 
given to nine places of decimals^ to find the logs of the integers between 1600 
and 1610 . 


The difference table of the original values is as follows: 


No. 

log. 

A. 

A* 

A* 

A* 

1600 

176091259 

2885688 




1610 

178976947 

2866641( 

- 19047 

249 


1620 

181843688 

2847843 

- 18798 

246 

-4 

1630 

184691431 

2829290 

- 18663 


{ 

1640 

187620721 
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Here m= 10. 

.•.v= — 4 ) = — 0-0004 in the ninth place, which is negligible, 

V= -^) = 0-2544 = 0-25 

IV- -6 IV which is approximately constant, 

Ai=^2885688 + ^1;^>(- 19047) + ^^ . ^^^ 249 

^(l-10)(l-20)(l-30), 

24.104 ^ 

= 288568-8 
857-115 
7-0065 
0-08265 
= 289433-094 


No. 

logs. 

Ai. 

Ai». 

A,*. 

1500 

176091269-1 

289433-1 



1501 

176380692-2 

289240-4 

-192-74 

0-26 

1502 

176669932-6 

289047-9 

- 192-49 

0-26 

1503 

176968980-6 

288855-6 

- 192-24 

0-26 

1504 

177247836-1 


- 191-99 

✓ 



288663-6 


0-25 

1605 

177536499-7 

288471-9 

- 191-74 

0-26 

1506 

177824971-6 

288280-4 

-191-49 

0-25^ 

1507 

178113252-0 


- 191-24 




288089-2 


0-25 

1508 

178401341-2 

287898-2 

- 190-99 

0-25 

1509 

178689239-4 

287707-4 

- 190-74 


1510 

178976946-8 





The required new table is : 


No. 

lug. 

No. 

log. 

1500 

3-176091259 

1606 

3-177824972 

1601 

3-176380692, 

1607 

3-178113252 

1602 

3-176669933 f' 

1508 

3-178401341 

1603 

3-176958981 

1609 

3-178689239 

1504 

3-177247836 

1610 

3-178976947 

1606 

3-177636500 




and the final value of log 1510 agrees with the original value. 
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32. An Alternative Derivation.—It is frequently convenient when 
dealing with a function whose degree is known to insert values of 
the function, intermediate to those already tabulated, by the following 
method: 

Suppose, for example, that a function /(sc) may be represented by a 
polynomial of the third degree, and that values of the function are 
tabulated at intervals ii; —10 of the argument. Let it be required to insert 
values'at an interval 1. Using the notation of the last section, we 
have (by the Gregory-Newton formula) 

T = ^ 

= C = 'o + 

Tj = <20= + 20Ai«o + 190Aj2«„ + 1140Ai®<o, 

Tg =/jo = <„ + aOA^to + 435 W + 4060Ai\ 

Differencing these equations, we see that 

ATo= 10Ai«n + 46Ai\ + 120Ai»/„, 

ATi= 10Ai«o+ 146Ai\+ 1020A,»<o, 

AT5,= 10Ai/o + 246Ai% + 2920Ai«<o. 

SimDarly A*T(, = 100 Ai*/(, + 900Aj%, 

AaTi= 100Ai\+ 1900Ai®/o, 

A®To= lOOOAja/o. 

The leading term and its differences for the subdivided intervals are 
seen to be . - 

A,»to = -OOlA®To, 

Ai\= -01 A*To - •OO 9 AST 0 , 

Ajtg = • 1 AT, - -046A®T, + •0286AST„* 
from which the values tj, tg, . , , are formed by addition. 

Ex.—Having given a table of values of log x at intervals of the argument 
w=5, to insert between log 6250 and log 6255 the intermediate values of 
the function at intervals w=s 1. 


Entry. 

T0 = lpg 6260 = 3-7968800 

A. 

A* 

3473 

-3 

Ti=log 6265 = 3-7962273 

3470 


T2=log 6260 = 3-7966743 

3468 

-2 

Tg=log 6265 = 3-7969211 




The differences of the second order are approximately constant, so we 
assume log a; to be a polynomial of the second degree. 

T,= /,= 3-7968800, 

Tj = fj=<0 SAj/q + lOAj®/,, 

Tj = <10 = t, -I- lOZ\-«0 + 46Ai»<0. 

AT0= 6Ai<0-HOAi%-= 3473, 

ATi=6Ai<o-H36A,*«o. 

A%=25Ai%=-3. 

* These are precisely the set of equations of § 3t when AT,, the third 
differences of the tabnlated function, are assumed to be constant. 
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From these equations we obtain the values 

-0*12, Ai^o=694-84, 

expressed in units of tlie seventh decimal place. 

Forming the difference table for the subdivided intervals, 


Entry, 

log 62 60 = 37958800-00 

Al. 

694-84 

Ai*. 

log 62 61 = 37959494-84 

694-72 

-0-12 

log 62 62 = 37960189-56 

694-60 

-0-12 

log 62 63 = 37960884-16 

694-48 

-0-12 

log 62 64 = 37961678-64 

% 62 66 = 37902273-00 

694-36 

-0-12 


We may now insert these values of the function in the table of values, thus: 

log 6251== 3-7959495 
log 6252 = 3-7960190, etc. 

We may obtain without dilliculty formulae for sub tabulation based 
on central-difference formulae, or on Everett’s formula. These are 
frequently to be preferred to the subtabulatioii formulae based on the 
Gregory-Newton formula. 

Owing to the rapid accumulation of error in the higher orders of 
differences, care must be taken to include additional places of digits in 
the computations, as in the above examples. 

33. Estimation of Population for Individual Ages when 
Populations are given in Age Groups. —We shall now find 
the values of a statistical quantity, such as the population of 
a given district, for individual years, when the sums of its 
values for quinquennial periods are given.* 

Let . . m. 2 , «.-i, Uq, Mj, « 2 , ... lie the values of the 

quantity for individual years, and let the quinquennial sums 
be . . ., W,, Wq, W.i, . . . , so that 

Wj +«g +M 5 +M 4 +« 8 , 

Wg =% +% +“o +W-1 + W-2. 

W_i = M_3 + «_4 + «_5 + «.6 + «-7. 

It is required to find the value Vq in terms of the W’s. 

* G.' King, J,I.A, 48 , p. 109 (1909). See also 60 , p. 32. 

a« 


(DSll) 
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The Newton-Stirling formula may be written 

«•(«»-1)., 


+ . 


If we denote tt„ + u_„ by and neglect the differences of 
the fourth and higher orders, we may write 

y„ = 2MQ + M®A®M_i. 

Therefore Wo = «o + yi + y2 

= 5mq + 5A%.j, 

and Wi + W_i = y8 + y4 + y6 + 2/6 + y7 

= IOmq + 135A®m_i. 

Eliminating A%.4 from the two last equations, % may be 
expressed in terms of the W’s: 

125^0=27Wo-(W.j + W,), 

or, writing A®W_i for (W_i- 2Wo + Wj), we obtain the result 
125wo = 26Wo-A2W.i, 

or = 0 - 2 W„ - 0008 A 2 W. 1 . ( 1 ) 

Ex.—To find the value of the quantity for the middle year of the second 
quinquennium, when the following are three consecutive quinquennial 
sums-. 36556: 39387: 41921. 

Denote the given quinquennial sums by W_i, Wq, respectively, 
and form a difference table. 


W_i = .3e 556 

2831 

Wo=39 387 -297 

2534 

Wi = 41 921 

The required quantity «q is therefore, by (1), 

«o=0-2 X 39 387 --008 (-297) 

= 7877-4+ 2-4 
= 7879-8, 

so M8=7880. 

The above formula may be extended to include the fourth 
differences of the W’s when we neglect the differences of the u'b 
of the sixth and higher orders.* We have now 

* When the groups are unequal, vo can proceed in a similar way, using 
divided differences. 
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= 2mj + - l)A%_a, 

■Wo = Mo + yi + y 2 

= 5wo + + A%_ 2 , ( 2 ) 

Wj + W.i = IOmq + 135A*w_i + 377 A*m.2, 

Wg + W-a = IOmq + 510A®tt_i + 4627 A*m_2. 

Eliminating «q from the three last equations, we have 


AaW.i = 125 A*m.i + 375A*M.a, 
and AWj-AW _2 = 375A*m.i + 4250AV-2> 

and eliminating Aa^t-i from these two equations we find that 
A4^m_2 = 0-00032A*W_2 

and A*m.i = 0-008A*»W.i - 0-00096A«W.2. 

If we now substitute these values in equation (2), we obtain 
the result 

Uq = 0-2(Wq - 5A®m . 1 - A*m . 2 ), 
or = 0-2Wo - 0-008A*W_i + 0-000896A*W.2. 


This value of «q was also given by G. King.* 

The following demonstration of a more general formula is due to 
Q. J. Lidstonc. 


Let 

J 

T 

II 

3r+I 

^ etc., 

r+l 

and let 

yx= 

(2r+l)*-r-l 

2 ug, 


V 


wliere p is some number independent of x. From these definitions we 
have at once 

Aya, = Wa. 


and 


= _L_. 

a ^2(2r+l) » 2(2r+l) 

In Bessel’s formula, 


2 /l + w 
put 


21 

t 

m = 


2(2r+ ly 


Foini the difference and in the result substitute Wund its 

* J,LA. 43, p. 114. 



60 THE CALCULUS OF OBSERVATIONS 


differences for Ay and its differences. We thus obtain the required 
formula. 

The result is 


ni(m^ — i) A o»Tr wi(m® — 4) (m® — ?).. 
«o=2«>Wo+ "1/ 2 ^ -^2-^A«W_, + . . . 


which, when 2r + 1 == 6, becomes 

I Ma=0-2Wo-0-008A*W_i + 0-000896A‘W_2 + . . . 

as found above. 


34. Inverse Interpolation. —We shall now consider the 
process which is the inverse of direct interpolation, namely, that 
of finding the value of the argument corresponding to a given 
value of the function intermediate between two tabulated values, 
when a difference table of the function is given. This is known 
as inverse interpolation. 

Let f{a + xw) denote a particular value of the function of 
which the differences are tabulated. We now wish to find the 
value of the argument x corresponding to /(« + Qcw ); for this 
purpose it is best, if -\<x<\,to use Stirling’s fonnula ♦ 
f{fl + xw) + x\{Lf{a) + A/(a - w)} + lx^l!s?f{a - w) 

+ \x[x^ - + A3/(a - 2w)] 

+ + • • • ( 1 ) 


Dividing throughout by + A/(a - w)], the coefficient 

of 05, equation ( 1 ) may be written in the form 

x^m- \x^Bi - \x{7? - 1 )T >2 -- 1 )D 3 -. . . ( 2 ) 

where m = {f{a + mo) -yi(a)}/HAyi[«) + A/i(a - ?r)}, 

- M/MAAa) + A/i(a - w)}, 

Dg = {A»yi(a - w) + - 2w)}/{Ay(a) + ^f(a - w)}, 

and so on. We have now to solve equation (2) bj successive 
approximations. 1 st approximation: a;=m. Substituting this 
value in equation ( 2 ) we obtain the 2 nd approximation: 

x — m — — 1 )D 2 — — 1)D8 — . . . 

This value of x is now substituted jp equation ( 2 ) to form the 
3td approximation for x, and so on for further approxiinations.t 

*1£ Bessers formula should be used. 

fAn excellent method of performing inverse interpolation with a cal¬ 
culating machine is described by L. J. Comrie in the aavXiooX Almanac for 
1937, p. 934. 
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Instead of solving equation (2) by successive approximations we may 
arrange it in the form 

m = x + - lp2 + - 1)^3 + . . . 

We have merely to reverse this series to obtain a formula from which x 
may be found by direct substitution, namely, 

a; = m(l+jD2 + . . •) + »^"( - J l>i + . . .) 

+ ?a3(iDi2-JD 2 -. . .) 

+ . . . 

As an example of inverse interpolation, supposes we wish to find the 
positive root of the equation * 

2)7+2834-480 = 0. 

Writing i/ = 37 +- 480, and finding by a rough graph that the 
coot is a little over 1*9, we construct the following difference table : 


z. 

y- 

A. 

A'-*. 

A”. 

1-90 

-25-71402C1 

11-0886094 



1-91 

- 14-6254167 

11-3179528 

0-2293434 

0-0041112 

1-92 

- 3-3074639 

11-5514074 

0-2334546 

0-0041775 

1-93 

8-2439435 

11-7890395 

0-2376321 


1-94 

20-0329830 





Evidently the root lies between 1-92 and 1-93, and therefore if the root 
be 1-92 + 0-0lx, we have by Stirling’s formula in equation (1): 

0= -3-3074639 + 11-4346801X+0-1167273x2+ 0-0006907(x3-x), 

0 = - 3-3074639 + 11 -4339894x + 0 -1 167273 j 2 + 0-0006907x3 

Dividing throughout by the coefficient of x, 

x= 0-28926595 - 0-0102088x2 - 0-0000604x3 


1st approximation : x = 0-28926595, 

2nd approximation ; x = 0-28926595 — 0-0102088 x 0*083675 

- 0-0000604 X 0-0242 


3rd approximation: 


= 0-28841027, 

x = 0-28926595 - 0-0102088 x 0-0831805 

- 0-0000604 X 0-0240 

» 0-288A 538. 


The required root is 1-9228841533, correctly to 10 decimal places. 

•^This equation was suggested by W. B. Davis {JSd, TimeSf 1867, p. 108) 
but solved otherwise by him. 
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35. The Derivatives of a Function.—From the Gregory- 
Newton formula 

f{a + xw) =f{a) + xHifia) + 

+ +. . . (1) 

we have at once 
/(a + a ac ) -/(«) 
xw 

= ^{AAa) + ^A»y(a)+^^-^A3/](a) + . . .}. 

If X is taken very small so that xw-^0, the left-hand side of 
the equation is of the form {/{^ + h) -J{a)}/K The limiting 
value of this expression when is the derivative of the 
function f(x) for the value a of its argument. We thus 
obtain 

/'(a) = + •..}. ( 2 ) 

The successive derivatives of the function may be obtained 
by the use of the differential calculus in the following way. 
Differentiating (1), we obtain 

w/(a + xw)= A/(a) + A2/(a) + 


4x^ - ISsfi + 22x X 

+-4r.—AVl(a)+. . . 

Also 

«>*/“'(« + aao) = A®/(a) + (»- l)A®/(a) + " Ay(a) +. . . 

and so on for derivatives of higher order. 

Putting CE = 0 in this set of equations, we obtain the results 
uf{a) = AM - JA*y(a) + i A®yKa) - JAyfa) + iA^/{a) 

-^A®/(a) + . . . 

«y"(a) = Ay[a) - A^f{a) + iA®/(a) + i|^A«/(a) -. . . 

«y'"(a) = A»yi(o) - tAy(a) + iAy(a) - ^A^A<^) +. . . 
wy^(a) = A*y(a) - 2A8y(a) + ^^A*Aa) -. . . 
wy^(a)=A«^a)-|A®/(a) + . . . 

«•/■(«)-A*/(o)-.. . / 
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lx,—To find the first and second derivatives of log^ x at x= 500. 


X. 

loge X, 

A. 

A*. 

A*. 

A. 

600 

6*214608 

19803 




610 

6*234411 

19418 

-386 

15 


520 

6*253829 

19048 

-370 

14 

-1 

530 

6*272877 

18692 

-356 

13 

-1 

540 

6*291569 

18349 

-343 



550 

6*309918 





Here w?= 

10 and 






10/(500) = 0*019803+ J(0-000385) + 1(0*000015) 
= 0 * 020001 . 


Also 100/'(500)= - 0*000386 - 0*000015 - f?.(0*000001) 
= -0*000401. 


Neglecting the last figure, which is liable to error, we obtain the results 
/(500) = 0*002000 
/'(500)= -0*0000040. 


We may find tlie formula for the ?ith derivative of a function other¬ 
wise, by using symbolic operators and expanding the function f(a + w) by 
Taylor’s Theorem. 

Thus /(<» + 'M>)=/(a) + w/'(a) + ^/"(“)+" • • (1) 

If yfa denote i the operator for differentiation, by D, equation (1) 
dx 


becomes 


««2n2 7i;3r)3 

/(« + «) = (!+«D + ^ + ^.+. . .)/(<»). 


or (l+A)/(a) = «-”/(«), 

and 1+A = e«’“ (2) 

Taking logarithms of each side of this equation, 
i«D = log. (1 + A) 

= A-JA* + JA»-. . . 

or w/'(o) = A/(a)-JA»/(a) + iA^a)-. . . (3) 

Also w*Da={log^+A)}2 

= (A-iA* + JA»-. . .)*. 

Thtrefore wy"(«) = (A — JA® + JA® -. . 

= A®/(«)-A®/(«) + HA«/(a)+. . 


( 4 ) 
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and in general 

«)»/'“>(o) = (A-+ + . . .yy(a). (6) 

36. The Derivatives of a Function expressed in Terms 
of Differences which are in the same Horizontal Line.— 

By differentiating Stirling’s formula, 

/{a + Tw) =f{a) + {Ayi(rt) + A/(a -w)} + - w) 

+ - 1*) j{A3/i(a -w) + A?J{a - 2w)} 

+ ^>V-U)Ay(ffi-2w) 

+ Tio^i^ - 1*) - 2«’) + Ay(a - 3w)} 

+ Tio**(®* -1*) (** - 2*)A*/(a - Zio), 

the differential coefficients may l»e represented by a rapidly 
converging series in terms of the horizontal differencea Thus 
w/'(a + mv) 

= ^{Af(a) + Af{a - w)} + xAY{n - w) 

+ - l)HA®yi(a -^) + Ay(rt - 2w)} 

+ - 2x)A*f{a - 2iv) 

+ + 4)J{A®/l(<t - 2io) + A^J{a - 3w)} +. .. 

v^/“{a + xw) 

= Ay(rt - w)+ic J{Ay(a -w)+ Ay^a - 2w)} 

+ *(12a;®-2)Ay[a-2««) 

+ Tvv(20a:®-30a;)i{Ay(a-2ip) + Ay(a-3M>)} + . . . 
Putting » = 0 in these equations, we have 

«’/(»)=i{AA«)+A/t« - ■'<’)} - WA^A<^ - w) + A®yi;a - 2w)} 

+ - 2w) + Ayi(a - 3w)} +. . . (1) 

«;*/”(«) =A*y][a-w)-xVAy(«-2w) + vV^y(«-3w) + . • • (2) 

These equations give the value of the derivatives in terms of 
differences which are symmetrical as regards the direction of 
increasing and decreasing aiguments. 

In order to extend these results to derivatives of higher 
order we shall write Stirling’s formula in the central-difference 
notation of § 20 as far as differences of the eighth order. 

/(a + xw) = Mo + »/“8 mo + Ja!*8*Mo - l)ia8%o + 

+ - 1) (** - 4)/*8®«o + - 1) - 4)8% 

+ Tirr5«(»* - 1) (®* - 4) (®* - 9)/t8»Mo 
+ -1) (a^ - 4) (a:* - 9)$%. 
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When the right-hand side is arranged according to ascending 
powers of x, we obtain 

/{a + xw) ^Uq + x{ii.8uq - i 

+ X^{^8^Uq - + T«“0^^O ” T 

+ a5®( J/i,S%0 "" 

+ “ 114 

+ - 062o^®^o ~ ^(3) 

If both sides of this equation are differentiated and we 
substitute the value a; = 0, we obtain the value of wf*{a) as in 
equation (1); and the higher derivatives of f{a) are formed by 
differentiating w/{a + xw), w^f”{a + xw), and so on. 

The successive derivatives of f(a) correct to differences of 
the eighth order are given by the following equations: 

Wf{a) = 

{a) = 

1 

We see that wf'{a) is equal to the coefficient of x in (3) and, 
in general, w'^f^^\a) is equal to the coefficient of x^ in the 
equation (3) multiplied hy n !. This result might have been 
obtained at once by comparing (3) with Taylor’s expansion 
of f(a + xw), 

37. To express the Derivatives of a Function in Terms 
of its Divided Differences.—We shall first find the derivative 
of a function f(x) for the particular value of the argument x 
in terms of its divided differences. As shown at equation (3), 
§ 13, we may write 

=/(®o «i) + (“ - + (“ - “i)(“ - «2ll/(«o-«j.«a “s) 

+ . . .+ («-«!)(«-a,) . . . (w-a»-il/i(ao-®i> • • •><*»»). 

where the divided differences of order beyond the wth are 
supposed negligible. If we Jut u = a^, we have 

fiP'Ci ^o) ^i) (^'0 ” 

• + (^0 - {uq - a2M^o» ^) + • • • 

+ (<1^0 — Of 2 ) (ttQ — ^ 2 ) . . . (ay — . • •> a>n), (1) 
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Bat in § 16 we found that 

f{v) =f{ao) + (« - a„V{a^ a,) + (« - ao)V{<^ 0 ' «o. «o) 

+ (m - ao)!/(ao> ®o> “»> ®o) + • • • 

and bj Taylor’s expansion, 

+ (« - + (« - + (« - + • • • 
so that /'(flo) =/l[ao «o). i/'K) =/K- “o. «o). “ general 

/»)(ao)/»i! =y(ao. % • • •. ®o). 

wliich gives the wth derivative in terms of the divided differ¬ 
ence of the wth order with repeated arguments. 

E(iuation (1) thus becomes 
/K) =,A«o“l) + («0 - + («0 - «l )(«0 - 

+ . . . + {ao-aj} (a„ -a^) . . . \a^- • • •> ®»). (2) 

which gives f'{a^ in terms of its successive divided differences. 
As a special case of this formula when + a2=ao+^^» 

m = ,^A/(»o) + (- + (-«)(- 2«>)3^Ay(a„) +. . . 

or w/'(ao) = A/'(a^-iAayi;ao) + JA®/(oo)-. . . 

which is the formula of § 35. 

A more general expression for the derivatives of a function 
in terms of its divided differences may be obtained from 
Newton’s formula: 

y(a?) =/(ao) + (» - ^i) + - «o) ^ 2 ) 

+ ^ 3 ) +• • • 

Denoting the factor (a? - a„) by on, this equation becomes 

•A®) “yi^o) "*■ ^ 1 ) + ^ 2 ) + ^2» ^s) 

+ . . . + aQa]^a2 • . . an-iy(^0» ^2> • • •> ^n)» (3) 

Differentiating both sides of this equation, we see that 
f{x) aj + (ao + ai)/K, aj, 

+ («o®i ®o®2 "*■ ^l®2]l/(®0> ^a) + • • • (4) 

(®)/2 I =y](^to* ^' 2 ) (®0 + ®^2» 

+ (a^ai + aQttg + + ajOg + + a20^{a^y «!. 03 , ^ 4 ) + . . . 

/'''(«)/3 ! ^s) 

+ (®0 + + ®2 ®1> ^2> ^3> ^ 4 ) + • • • (6) 

^ =/K> rt2» ^ 3 . ^ 4 ) 

+ (uq + ttj + a 2 + 03 + ci4)/(<*o> ® 2 * ^3> >^4> ^s) + • • • ( 6 ) 

and so on. 
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In these equations the coefficient of the divided differences of order r 
is a symmetric function of the quantities a^, a^, ar_i- IJi 

equation (3) this coefficient is of r dimensions, and after each differentia¬ 
tion its dimensions decrease by unity; so we see, therefore, that ilie 
coefficient of /(a^, a^) in the equation for /(^)(a!)/r! is unity 

(i.a zero dimension in o^, . . and all differences of lower 

order vanish. 

If we suppose aQ = aj = a 2 = <*3 = * • • = we obtain the values given 
above : f\a^=f{a^ /''(«o) = «o» «o)» o”* 

Substituting in equation (4) the value sc = ttQ, we obtain equation (2), 
namely, 


W =/(«0» H) + (% “ «2) + (^0 - ^l) (^0 “ «2V(«0> «2I «3) + • * * 

The latter equation is used when the derivative of a single value of 
the function is required; but when the derivatives of several values of 
the function are to be computed, we use equation (4). 


;Ex.—F rom the folloioii^g table of mines compute the third and fourth 
derivatives of f{6) when the argument 6 has the values 5, 14, and 23 
respectively. 

^ 2 4 9 13 16 21 29 

/(0). 57 1345 66340 402052 1118209 4287844 21242820 

We first form a table of divided differences: 


e. 

fW- 




aQ= 2 

57 





644 




«!== 4 

1345 

1765 




12999 


556 


0^= 9 

66340 

7881 


46 


83928 


1186 


agss 13 

402052 

22113 


64 


238719 


2274 


04=16 

1118209 

49401 


89 


633927 


4054 


05=21 

4287844 

114265 




2119372 




05=29 

21242820 




The function is evidently a polynomial of the 5th degree. 


Tabulating the values of Gq, a^, og, . 

. we find 




«-6., 

«o ^ 

^=14. 

0 = 23. 



12 

21 




10 

19 



oa - 4 

5 

14. 


• 

03 - 8 

1 

10 



a, -11 

-2 

7 



1 

1 
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F!rom equation (6) we have at once 

J/"'(^-.666 + (a„ + ai + aj + aj^45 , 

+ (“«“l + «o“2 + “0“3 + “0“4 + “l“2 + “l“3 + “l“4 + “2“3 + “2“4 + “ 3 * 4 )^ 

SO /"'(5) = 1 626, f"\U) = 12 102, /"'(23) = 32 298. 

From equation (6) wc have 

-^''{0) = 46 + (flo + Oi + «2 + Os + a*)!, 

SO /■''(6) = 624, /■''(14)=1704, /"(23) = 2784. 


Examples on Chapter IV 


1. The logs of the numbers 400, 410, 420, 4»30, 440 being given to 
seven places of decimals, find the logs of the inti‘gers between 400 
and 410. 

log 400 = 2*6020600 
log 410 = 2*6127839 
log 420 = 2*6232493 
log 430 = 2*6334685 
log 440 = 2*6434527 

2. If A^’To i» the last non-zero difference of the original sequence, so 
that A’“^iTq=0, A»’‘^*To= 0, . . show that the formulae for sub¬ 
tabulation are : 


A/-*to= ^-,A>-aT„ + P^nDAr-iTo 

f(r-2)0 -m)(l-2m) (r-2)(r-3)(l 

\— 2 :^-+-8^^?-r “• 


The differences of order higher than the rth in the new sequence 
are, of course, all zero. 

3. The following are three consecutive quinquennial sums: 

44133, 41921 ahd 39387. 

* Mouton, an astronomer of Lyons» in 1670 noticed that if in a sequence 
whose rth differences are constant, 8ay=c, intermediate terms are inserted 
corresponding to a division of each interval of the argument into m equal parts, 
then the new sequence has its rth difference constant and equal to c/m**. 
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Find the value of the quantity for the middle year of the second 
quinquennium. 

4. The populations for four consecutive age groups are given by the 
table of values 


Age Group. Populaiwn. 

25 to 29 years (inclusive) 458572 

30 to 34 years ( „ ) 441424 

35 to 39 years ( „ ) 423123 

40 to 44 years ( „ ) 402918 


Estimate the populations of ages between 32 and 33 years, and between 
37 and 38 years respectively. 





then the individual value may l)e found from the groups of t indi¬ 
vidual values Wq, AVi, AVg,. . . and their differences by the formula 


, = ^« + (2® - < + 1 + { 3®* + 3®(1 - 2<) + (1 - 3< + 2«*)} 

t t .2! 


A*W, 

?.3l’ 


where third differences are neglected.* 

'^6, In the following set of data h is the height above sea-level, p the 
barometric pressure. Calculate by a dilfei-ence table the height at which 
jp = 29 and the pressure when /i = 5280. 

0 2753 4763 6942 10593 

p=30 27 26 23 20 

^ 7. Fonn a difference table from the following steam data, where p is 
pressure in lbs. per square inch. 

93-0 96-2 100-0 104-2 108-7 

p 11-38 12-80 14-70 17-07 19-91 

Calculate p when 0=99°-l and determine by inverse interpolation the 
temperature at which 15. 

8. Calculate the real root of t^ie equation 
a;8 + a;-3 = 0 

by inverse interpolation. 

♦ C. H. Forsyth, Quarterly Puhlimtions of the American Statistical Associa- 
iion^ December 1916. 
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9. Find the differential coefficient of log^ as at x s 300, given the table 
of -values 


X. 

log,®. 

300 

6-703782474666 

301 

6-707110264749 

302 

6-710427017376 

303 

6-713732805509 

304 

6-717027701406 

305 

6-720311776607 

306 

6-723685101952 

307 

6-726847747687 

Find from the above table the differential coefficient of log^ i 
10. Given the values 

X, 

V- 

0 

858-313740 096 

1 

869-645772 308 

2 

880-975826 766 

3 

892-303904 583 

4 

903-630006 875 

find the value of when 

dx^ 

x=(i. 

v/ll. Find ^ when 21 = 1 
dz^ 

, given the following values: 

z. 

y- 

1 

0-198669 

2 

0-296620 

3 

0-389418 

4 

0-479425 

5 

0-664642 

6 

0-644217 


12. Apply the central-difference formulae of § 36 to compute the 
first and second derivatives of log^ 304, having given the table of values 
of Ex. 9. 

'/is. From the following data compute the first four derivatives of the 
function y corresponding to the argument a; = 11: 


X. 

y- 

2 

108 243 219 

6 

121 650 628 

9 

1 141 158 164 

13 

163 047 364 

16 

174 900 628 

21 

214 368 884 



CHAPTER V 

DETERMINANTS AND LINEAR EQUATIONS 

38. The Numerical Computation of Determinants.— 

In this chapter we shall consider the problem of finding the 
numerical value of a determinant, say, 

dl 02 02 ^4 ^5 

6 l 62 ^3 ^4 ^6 

Cl C2 C3 C4 6*5 (A) 

6?1 <^2 ^3 ^^4 ^6 

C| 62 C 3 ^4 65 

when the elements ai, 02 , .. . are given numbers. The method 
generally adopted, which is due to Ohio,* is as follows: 

We first notice whether any element is equal to unity; if 
not, we prepare the determinant for our subsequent operations 
by multiplying some row or column by such a number p as 
will make one of the elements unity, and put 1/p as a factor 
outside the determinant. This unit element will henceforth 
be called the pivotal element. Thus in tlie above determinant 
we shall suppose that 63 = 1 and take Iz as the pivotal element. 
We shall show thot the above determinant is equal to the 
determinant 

( - 1)2+® - a^x - « 3^2 - ^364 cfs - 

Cl — ^3^1 C 2 "" C 3&2 ^4 ^3^4 ^5 ^3^6 

di “■ dzbi 6?2 “ C?3^2 ^4 ” ^3^4 ^^5 ^3^6 

Cl ~ C361 C2 - C3&2 C4 - C364 es - C3&5 

♦ F. Cbi6, Mirruyire sur les fonctions connues sous U nom de risuUantes ou de 

Turin (1853). 
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The law of formation of this new determinant (B) may be 
expressed thus: The row and column intersecting in the pivotal 
element of the original determinant, say the Tth row and s^th 
column, are deleted; then evei'y element y is diminished hy the 
product of the elements which stand where the eliminated row 
and column are met hy perpendiculars from y, and the whole 
determinant is multiplied hy (- !)*■+*. 

The advantage gained by substituting the determinant (B) 
for the determinant (A) is that {B) is of order one unit lower 
than A ; and therefore by repeated application of this method 
of reduction we can reduce any determinant to the second order, 
when its value may be written down at once. 

To prove this theorem, we first divide the columns of the 
determinant (A) by hi, h ^,. . 65 respectively, so that it takes 

the form 



62 ^5 

Ol 

02 

^3 

a* 

«6 




h 

h2 

hi 

hi 




1 

1 

1 

1 

1 




Cl 


C 3 

C4 

Cb 





h 

hi 

h 




di 


j 

di 

db 




hi 

hz 

dz 

hi 

h 




Cl 

62 

ez 

«4 

Cb 




hi 

62 

hi 

hi, 


since h^ = 

1 : then (subtractin 

g the elements of the third column 

from those in the other 

columns) we write the determinant in 

the form 








hi h% 64 &5 





«4 


% 



. 

hi' 

^3 

hi 

-«3 

r--(lZ 

h 



0 

0 


1 

0 


0 




Ci 

-C 3 

Cs 

C4 

-C 3 

Cb 




hi 

hi 




^ d 

hi ^ 

d^ 

62 

-rfa 

d^ 

di 

hi 

-^73 

^-di 

05 






ez 

C4 


Cb 




w 

hi 

-^3 

h 
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This determinant may now be written 






«4 ^ 

fits 









fis 


r* "^3 

T 

^2 




di j 

r — rta 

h 


^4 7 

^5 7 

Fs-'^ 




^4 

^6 




Fr^ 



which is obviously equivalent to the form (B). 

It is usually advisable to prepare tlie determinant for computation by 
forming zero elements in the row and column containing the pivotal 
eletnent. For example, in the determinant (A) of Ex. 1 below, zero 
elements may be introduced into the first row by adding three times the 
third column to the first column to form the new first column; then 
adding the third column to the second column to form the new second 
column, and so on. In this way the subsequent calculations are simplified. 

In performing the computation of a determinant it will be found 
convenient to draw pencil lines through the row and column which 
intersect in the pivotal element; this helps the eye in finding the 
elements at the feet of the perpendicular. 

Instead of first dividing some row or column in older to obtain a 
pivotal element equal to unity, we may eliminate the row and column 
intersecting in any element (not necessarily unity) as follows ; Delete the 
row and column in question by drawing pencil lines through them ; then 
the rule is 


, ,, , product of elements at feet of perpendiculars 

New element = old element — j ~ i —;;--> 

element at intersection of pencil lines 

the new determinant being multiplied by the element at the intersection 
of the pencil lines and by the factor ( — l)^'*'*, where r and s denote the 
numbers of the deleted row and column respectively. 


The above method of computing the value of a ileterminant 
enables us at the same time to compute the co-factors of the 
elements corresponding to the surviving elements; for these 
co-factors are actually equal to the co-factors of the correspond¬ 
ing elements in the reduced determinant. 

For example, the co-factor of Cj in determinant (A) is 

^4 ®6 

^8 ^4 ^6 

^2 ^3 dj 

«2 ^ ^4 
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which, since & 3 = 1 , is equal to 

^2 — <*362 «4 — <* 3^4 «5 — 0365 

^2 “■ ^^4 ■“ ^ 8^4 ^6 ■" ^ 3^5 

H ^4 ■“ ^ 8^^4 ^6 ^^^5 

the latter determinant being the co-factor of (cj —C 3 & 1 ), which is the 
element in the reduced determinant (B) corresponding to in (A). 

Ex, 1 .—Evaluate 


3 1-1 

-2 3 1 

14 2 

6 -2 -3 

-1 1 2 


2 1 

4 3 

3 1 

5 -1 

3 2 


(A) 


We may select as pivotal element the number 1 at the intersection 
of the first row and the fifth column; the rule then gives 


3 

1 

- 1 

2 

1 

= (-!)* 

-11 

0 

4 

-2 

-2 

3 

1 

4 

3 

- 2 

3 

3 

1 

1 

4 

2 

3 

1 


8 

-1 

-4 

7 

6 

-2 

-3 

6 

-1 


- 7 

-1 

4 

-1 

- 1 

1 

2 

3 

2 







Taking as new pivotal element the unit at the intersection of the 
second row and last column, we obtain 


(- 1 ) 8+6 


-16 
22 
- 9 



10 

-25 

7 


(C) 


As there is now no unit element in this determinant, we may 
divide the second column by 2 and so fonii a unit pivotal element. The 
determinant becomes 



-16 3; 10 

22 - 11 : -26 
- 9 . 1. 7 

2 (-1)5[ 12 -11 I 

1-77 62 I 

446, the required value. 


(D) 


If it were required to determine the co-factor of (say) the element 
in the fourth row and first column of the above determinant, we 
should have the co-factor of 6 in (A) equal to the co-factor of 8 , the 
corresponding element in the reduced determinant (B) 

=a the co-factor of 22 in the next reduced determinant (C) 

= the co-factor of — 77 in (D) 

« - 22 . 


Ex, 2 .—Evaluate the determinant for the Legendre polynomial of order 
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P6(*) = 


2 a 1 0 

“ 61 1 3a 2 

0 2 5a 

0 0 3 

0 0 0 

We eliminate the fourtli row and Jiftli 
element at the intersection of these lines 
determinant. Thus 

61 P 5 (s) = 


(eliminating the 1st row and 2nd col.) = 


(eliminating the 1st row and 2nd col.)=s — 8 


0 
0 

3 
7a 

4 

column, 

hecomes 


0 

0 

0 

4 

9a 


noting that the 
a factor of the 


-4 

s 1 

0 


0 


1 3;i 

2 


0 


0 2 

5a; 


3 


0 0 

27;^ 

"" V 

4 

C32;2 

'T 

4 

1-32j2 

2 


0 


- 22 ; 

5z 


3 



27z 


G3z^ 


u 

“ “4 



-8 


9z 


•j 


2 “ 

J 


0 


81k'’ 

--g- + 

27z 

IT 

4 

63x2 

4 


= 945a5-1050a3 + 225a. 

So we have = J(63a® - 70a® + 15a). 

39. The Solution of a System of Linear Equations.— 

Being now in a position to compute the numerical value of a 
determinant, we can solve a set of linear equations in any 
number of unknowns, Xi, X 2 »Xz,. . Xn, say 

' ^11 a?i + «i2 a:2 +. . . + Xn = Ci ' 

^21 ^1 ■+* ^2 3?2 " i " • • • + ~ ^2 

Xy + ttji2 3^2 • • • + (^nn ~ 


by the formulae which are proved in works on determinants, 
namely: 




^12 

^13 • • 

, . dy^ 


an 

Cl 

ai 3 . 

. . 


C2 

^<22 

^23 . • 

. Ozn 


^21 

Cz 

023 • 

. . dzn 





• • ^nn 

,»2 = 

a,n 

Cn_ 

3}^ : 

• • Ct-nn 

ttii 

ai2 

ayz . . 


dll 

«12 

013 . . 

, . Oi^ 1 


^21 

C^22 

< 3^23 . • 

. (Izn 


dzi 

1 

023 • • 

. d2n j 

. . i 


®nl 

a.n2 

d nZ • 

, • d^in 


fhil 

^n 2 

On 3 . . 

. • dnn 


and similar expressions for Xs, X 4 ,. . ., 

* Further remarks and examples on the solution of linear equations will be 
found in Chapter IX. in connection with the solution of the “ normal equations" 
in the Method of Least Squares. 
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Ex,—Fmd the values of w that satisfy the system of equations: 

Qx + ^y + 4z + 2w^2S' 

3a;+62-4w= 12 

y+ was 6 

6a;— y+3wasl9^ 

We have at once 

28 3 4 2 

- 12 0 6 -4 
5 111 

19 -1 0 3 

9~ 3 “ 1 ““2 

3 0 6 -4 

0 111 

6-103 

28 4 2 

12 5 -4 414 

5 1 1 ““207““ • 

19 0 3 

and in the same way we obtain the values »== 4, wa= 3. 


Examples on Chapter V 


Evaluate the determinants 


2 

4 

3 

7 

0 

(i8) 

1 

3 

0 

0 

0 1 

2 

3 

5 

5 

3 


-1 

2 

3 

0 

0 1 

1 

0 

2 

1 

2 


0 

-1 

2 

3 

0 

3 

0 

1 

0 

6 


0 

0 

-1 

2 

3 ; 

2 

1 

2 

4 

1 


0 

0 

0 

-1 

2 1 



( 7 ) 

2 

1 

0 

0 

0 







3 


1 

0 

0 







4 

1 

2 

1 

0 







6 


3 


1 







6 

1 

4 

4 

2 





2. Show that the determinant 

2 1 0 0 0 0 

22 5 1 0 0 0 

0 32 7 1 0 0 

0 0 42 9 1 0 

0 0 0 52 11 1 


with (n — rows, is equal to nL 


9 

1 3 
207 0 
6 
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3. Compute the values of 


1 

1 

0 

0 

(/8) 

3 

7 

1 

2 

51 

\ 

i 

1 

0 


6 

4 

3 

0 

2 

\ 


i 

1 


0 

3 

0 

1 

2 

Y 

1 

i 



1 

0 

6 

5 

3 






2 

1 

0 

2 

0 


(y) 1 2 3 4 5 6 

2 3 4 5 6 1 

3 4 5 6 1 2 

4 5 6 1 2 3 

5 6 1 2 3 4 

6 1 2 3 4 5 

4. Verify the relation 

2"= 1 1 0 0 0, (w+l)rows, 

-11100. 
112 10, 

-113 3 1. 

1 1 4 6 4 . 

the constituents being binomial coefficients, wlien n = 2, 3, 4, • , 

5. Solve determinantally the following sets of equations : 

(a) fl! + 2y-2» =3^' 

3a;+ y + 2a(;=:4p 

a- iu = 6g 
jc- 2 - «; = 0 

(fi) *,+ 6*3 - 2*4+ a^= 

4.^2— 33/0 “f" 

- 9a:i+ 83'/2+103)4= 

2xi + 203)2 - 13x0 + 3)g= 

bx ^ + 43)0 + 3^C0 + 23)4 + 43)^= 





CHAPTEE VI 

THE NUMERICAL SOLUTION OF ALGEBRAIC AND 
TRANSCENDENTAL EQUATIONS 

40. Introduction. —In tho present chapter we shall show 
how to find the value of an unknown quantity which satisfies 
some given algebraical or transcendental equation; or the 
values of several unknown quantities which satisfy a set of 
given equations^ equal in number to the number of the 
unknowns. 

The methods in use may be classified as follows: 

(a) Literal methods, in which the solution is obtained as a 
general formula^ so that nothing remains but to substitute 
numerical values in the formula; as, for example, the solution 
of the quadratic equation + 2 lx + c = 0 by the formula 

55 = - h±'\/W^. 

These literal solutions are valuable when they can be 
obtained,* but in most of the cases we shall have to discuss, 
they are unattainable, at any rate in a form involving only 
a finite number of arithmetical operations. 

{P) Numerical or computer's methods, in which the working 
is mainly arithmetical from the beginning. These are, on the 
whole, the most useful, particularly when a high degree of 
accuracy is required, and they constitute the main topic of 
the present chapter. 

(y) Graphical methods, in which the solution is obtained 
by drawing diagrams. These are much used when a rough 

* The solution of tho cubic discussed in § 62 belou*, and the general formula 
proved in § 60, are examples of literal solutions. 

78 
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solution is all that is required or as a preparation for a more 
ac5curate solution by numerical methods, but for many purposes 
they have been superseded by the 

(8) Nomographic methods, in which a diagram is prepared 
once for all to serve for a wide class of cases, so that it may 
be used over and over again with different numerical data. 

(c) Mechanical methods, in which some mechanical arrange¬ 
ment is applied; many ingenious machines have been de¬ 
vised for the purpose of solving different equations, but on 
account of their cost and complexity they have not come into 
extensive use. 

41. The Pre-Newtonian Period. —A method for the ex¬ 
traction of the square and cube roots of numbers, digit by digit, 
was discovered by the Hindu mathematicians, and by them 
communicated to the Arabs, who transmitted it to Europe. 

This method was extended by Vieta in 1600* so as to 
furnish the roots of algebraic equations in general. The 
process was so laborious that a seventeenth-century mathe¬ 
matician described it as '^work unfit for a Christian,” f but 
it was in general use from 1600 to 1680. 

In 1674 a method depending on a new principle, the 
principle of iteration, was communicated in a letter from 
Gregory to Collins; t and independently, a few months later, 
in a letter from Michael Dary to Newton. § This principle we 
shall now discuss. 

42. The Principle of Iteration. —As a first illustration 
of the principle of iteration we shall consider an algorithm 
suggested by Newton 1| for the determination of square roots, 
which may be described as follows: 

Let N he the number whose square root is required. Take 
any number and from it form X\ according to the equation 
a?i = J(a!;o +N/a^). From Xi form X2 according to the equation 

= + From X2 form, x^ according to the equation 

VI 

• De numerosA potestatum adfectarum resolutiomej 1600. 

t Warner in Rigaud’s Correspondence of Scientific Men of the 17th Centuryt 

1, p. 248. 

t Rigaiid’s Correspondence, 2 , p. 255. 

§ Rigand, op, eit, 2 , p. 365. For an account of Dary see Rigaud, 1, p. 204. 

II Op, eit, 2, p. 372*. 
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X3 = ^{xt + N(Xt), and so on. Then the sequence of numbers 
Xo, Xi, Xi,Xi, . . tends to a limit which is J'N. 

Thus, taking N<s 10 and Xq= 1, we have 
»i-i(l + 10) = 6-6, 

ae* = 1(6-6 + 10/6-6) = J(6-5 + 1-8) = 3-7, 

*3=1(3-7 + 10/3-7) = i(3-70 + 2-7) = 3-2, 

»4“|(3-2 + 10/3-2) = 1(3-2 + 3-125) = 3-163, 

*6= 1(3-163 + 10/3-163) = ^(3-163 + 3-161565) = 3-1622776, 
*^-1(3-1622775+ 10/3-1622776)= J(3-1622776 +3-1622778) 

= 3-1622777, 

which is the square root of 10, correctly to seven decimal places. 

In order to prove the validity of the process, we proceed as 
follows; 


The equation 

II 

-i + N/ajp-i) 

may be written 

Xp — JTS _ i 

<Xp.i- VN\* 


Xp+ J'S 

'va^_i+ VN/ ’ 

whence we have 


(Xa- 


»„+ ja 

U+ VN/ 


From this equation it is evident that if 

|< 1, then Lt„^„ar„ = ; (1) 

then LW„®„= - JS. (2) 

The limiting case is when 

I Xo - J'N I = 1 ®b + JIS |. 

If we write Xq = re^^, N = Xe^®, 

this becomes 

I r cos 0 - X* cos Jo + ir sin 0 - tX^ sin Jo | 

= I r cos ^ + X^ cos Jo + ir sin 0 + iX* sin Jo |, 
or, squaring both sides, 

+ X - 2rX* cos (0 - Ja) = + X + 2rXi cos (<9 - Jo), 

so cos {0 - Jo) = 0, 

or 0=±9O® + Ja, 

which is the equation of a straight line through the origin in 

* If « is 4 complex number, say equal to where u and v are real 

numbers, then tj is denoted by |e| and is called the modulus of z. 
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the plane of the complex variable Xo, perpendicular to the line 
joining the points JN and - JN. Denoting this line by I, 
we regard it as dividing the plane into two half-planes; and 
we see from (1) and (2) above that ^Ae algorithm leads to the 
valvje ^Ny as long as the initial number Xq is taken in that 
half of the plane which contains JN ; and the algorithm leads 
to the value - JN, as long as Xq is taken in that half-plane 
which contains - JN. If x^ is taken exactly on the line Z, the 
sequence x^y sci, x^t . . does not tend to a limit.* 

A pleasing characteristic of iterative processes may be observed in 
connection with this example, namely, that a mistake in the performance 
of the numerical work does not invalidate the whole calculation. If, for 
example, a mistake were made in calculating from the erroneous 
value obtained might have been obtained correctly by starting from 
a diflferent value ; and since Xq is to be taken arbitrarily, the true 
solution may be reached by way of as well as by way of x^. The 
correct result is obtained whenever the numbers Xr+v air+2» • • •> 
obviously tending to a limit, however many errors may have been com¬ 
mitted in obtaining these numbers. This valuable feature of iterative 
methods has made them very popular. 

43. Geometrical Interpretation of Iteration. —The nature 
of iterative methods may readily be illustrated f geometrically. 
Let f(x) = 0 be the equation. Write it in the form fi(x) =/ 2 (a?), 
as may usually be done in many ways \ thus, if the equation is 
3x^ - 8x + a = 0, we can take fi{x) = 3x^, / 2 (a) = 8x - a. Draw 
the curves y =/i(x) and y =/ 2 (a?); the real roots of f{x) = 0 are 
evidently the abscissae of the points of intersection of these 
two curves. An iterative process for finding them may be 
devised as follows: select any point Xo on the axis of x so that 
the value of Xq is nearly equal to that of the abscissa of one of 
the points of intersection of the curves. From Xq draw a 
straight line parallel to the axis of y until it meets the curve 
which has the slope of lesser magnitude. Suppose, for example, 
when x — Xo that |/i'(x)|<|/ 2 '(®)| and that the line x = a!;o meets 
the curve y = /i(x) at the point {xo, yo)- From this second point 
draw a line parallel to the akis of x until, it meets y In 

the point (xi, y^. From the third point draw a line parallel to 

* For further work on iterative solutions of quadratic equations and their 
connection with geometry cf. 0. Nicoletti, Rend, di PaZermo, 42 (1917), p. 73. 

t Cf. R. Ross, NaJLwrey 78 (1908), p. 663. 

(0 811 ) 4 
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the axis of y until it meets the curve /i(®) in (* 1 . yi)» 
from this fourth point a line parallel to the axis of x, and so on. 



* X-Xo 

Fio. 6. I’lQ. 7 . 


Then the abscissa of the first and second points is Xo, that of the 
third and fourth points is and in general !r» approaches nearer 



Fio. 

to the point of intersection of the two curves for increasing 
values of n ; i.e,, Xn converges to a root of the original equation. 
There are two main types of diagram resulting from this 
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process according as the slopes of the two curves have the same 
or different signs for the abscissae Xo» a?i, X 2 , • . . 

In Fig. 6 the alwcissae . . . are all on the same side of 

the root x and the lines approach the point of intersection of the curves 
in the form of a “ staircase.” In Fig. 7 the lines approach the inter¬ 
section spirally. The staircase solution is obtained when the derivatives 
of the curves f^ix) have the same sign near the point of inter¬ 
section and the spiral solution occurs when those derivatives have 
opposite signs. 

Ex, 1 .—To find the real roots of the equation 
a;®-a;-0*2 = 0. 

The real roots are the three intersections of the curves y = sc® and 
y=afl! + 0*2 as shown in Fig. 8. 

We can iterate to each of the three roots as follows: 

For the positive root 


x^yl 

i/ = 0-2 -f aj 

Xq= 1-000 

1-037 

Og* 1-0434 
3-^=1-0445 
a;4= 1-04472 
(The root is a; = 

yo= 1-200 

2/2= 1-2434 
?J3= 1-2446 

2/4= 1-24472 
= 1-0447616.) 

Fop'lhe larger negative root 

'A .., *=# 

y = 02--\-x 

f - 1-000 

a;-l= — 0-956 

0^2= -0-9456 
a-3= -0-9430 

3:4= -0-9423 
-0-94214 
x^= -0-94210 

(The root is x 

y„= — 0-800 
-0-766 
y,= -0-7466 

2,3= -0-7430 

3^4=-0-7423 

1,5= -0-74214 

= - 0-94209.) 

For the smaller negative root 

2 c = ?/-0-2 

y = sfi 

a;^ =-0-000 j 

-0-200 

a;2=-0-20032 

1 / 3 = — b«oo6 

~ 0-00032 
2/2= -0-0003226 


(Correctly to five places.) 


Eqr, 2.— Find, correctly to five decimal places, the root of the equation 
y + logi(yy=o-5 

ly iterating the formula yp+i = 0-5 - 
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By inspection of a log table we see that ^ approximate 

value of the root So we take 

yi = 0-6 - logiQO‘68 = 0*6675, 
t/2== 0*5 - logij^0*6676 = 0-6756. 

As the iteration evidently furnishes values alternately less and greater 
than the root, we take (^1 + 2 / 2 )/^ 0-6716 as the next approximation 

^8, and then we have 

=0-5-logioO-6716 = 0-6729. 

Take ^5 = (y^ + yJl2 = 0-6723, 

y^ = 0-5-logi80-6723 = 0-672437, 
y^ = 0-5-logioO-672437 = 0-672347. 

Take y^ = (y^ + y^)l2 = 0.672392, 

y^ = 0-6-logi80-672392 = 0-672377, 

the mean of the last two values = 0-672386, 

3/u = 0-5 - logioO-672385 = 0-672382. 

The root, correctly to five decimal places, is 0-67238. 


Ex, 3 .—Find {using only Barloufs table of cubes) the smaller ^positive 
root of 

a® - 2a; + 0-5 = 0. 


44. The Newton-Raphson Method.—It is evident that in 
iterating towards any root we are not bound to proceed by 



rectangular steps, as we have dond in the preceding article; we 
might just as well have proceeded by oblique steps, as in the 
following method: 

Let QiP be an arc of a curve y -f{x), intersecting the axis 
of X at A, so that the abscissa of A is a root of the equation 



ALGEBEAIO & TEANSOENDENTAL EQUATIONS 85' 

f{x) = 0. Suppose that the arc AP is convex to the axis of a?, 
and that P is a point on this arc with abscissa Xq. At P draw 
a tangent to the curve meeting the axis of a: in M and let 
OM =Xi. Let Q be the point on the curve whose abscissa is Xi, 
and at Q draw a tangent to meet the axis of a? in N; write 
ON = X 2 - Let E be the point on the curve whose abscissa is a^, 
and similarly at E draw a tangent to the curve to meet the axis 
of X, It is evident that the points L, M, N,. . tend to A, or, 
in other words, the values aro, Xi, X 2 , ajs, . . ., form a sequence 
tending to the root of the equation f(x) =0. If, however, we 
start on the other side of A, where the curve is concave to the 
axis of a?, at Qi say, the first step of this method carries us 
to the other side of A, where the arc of the curve is convex to 
the axis of x, after which the sequence tends to the root as 
before. 

Now we have 

Xq-Xi^ ml « LP cot PML =/(a^o)//(ai)), 
so Xi^Xo--/{Xo)lf{Xo), 

and in general Xr+i = Xr -f{xr)lf(xr). (1) 

The process is therefore an iteration based on the equation (1). 
In substantially this form it was given by Eaph8on*in 1690; 
but the method is commonly called Kewton’s, because Newton 
had previously f suggested a nearly related process.}: 

The preceding discussion is really based on two assumptions : 

1. That the slope of the curve does not become zero along the arc 
QjP; i.e, that the equation /'(«) = 0 has no root between and irQ, the 
abscissae of and P. 

2. That the curve has no point of inflexion along Q^P. 

The rule of Newton becomes more precise if we make use of the 
observation that we can determine which of the two abscissae and Xq 
corresponds to the part of the curve which is convex towards the a:-axis 
from the condition that at points where the curve y^f(x) is convex 
towards the axis of x, we have the relation 

/(»r(*)>o. 

* Analysis Aeguationvm Universalis^ London (1690). 

t Wallis’ Algebra (1686), p. 338. 

X The difference between Newton’s process and Baphson’s is that Newton 
calculated a set of successive equations, whose roots were the successive residuals 
between the above quantities Xn and the true value of the root, whereas in 
Raphson’s form of the process this is unnecessary. 
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Hence we see that if /(sc) has only one root between two hounds xf and 
Xq, while f (a) and, f\x) are never zero between these bounds^ then the Newton- 
Raphson process will certainly succeed if it be begun at that one of the 
bounds for which /(x) and f\x) have the same sign. 

We might have derived Newton^s method by the aid of Taylor^s 
Theorem as follows: 

Let scq be an approximate value of a root of the equation f(x) =* 0. 
Put osssscq+P, where p is small. Then by Taylor’s Theorem 

0 “/(flCo+P) “/(*b) + Pfi^o) + involving higher powers of p ; 
so approximately we have 

and therefore 

which is Newton’s formula.* 


Ex, 1 .—Let the equation be 

sc® — 2x — 5 = 0.*j’ 

Here it is obvious that an approximate value of the root is 2. 


Taking 2, we have 


/( 2 ) 


Next 




-/( 2 )= 2 + 10 - 2 - 1 - 


0-061 

•1 - rrS 5 “ 2-1 -0-0064 (nearly) = 2-0946. 


Now 

so 


/(2-0946) 0-0006415606 

11-16205 


0-000048617 


X3= 2-094600000 - -000048517 
» 2-094561483. 


The required root is 2-09466148 correctly to the fimt nine digits. We 
postpone for the present (cf. § 50) the answer to the question, how we 
know the number of places to which our result is correct. 

Ex, 2 .—Find correctly to four decimal places the greatest root of 
sc* - 4x* - X + 3 sa 0. 


* On the formulation of conditions under which Newton's method of ap¬ 
proximation leads to a root of an equation cf. Cauchy, (EuvreSy Scr. 2, 4, p. 573 ; 
a. Faber, Journ, fUr Math, 188 (1910), p. 1. 

t ** The reason I call sc’~2x-6=0a cell^brated equation is because it was the 
one on which Wallis chanced to exhibit Newton's method when he first published 
it, in consequence of which every numerical solver has felt bound in duty to 
make it one of his examples. Invent a numerical method, neglect to show how 
it works on this equation, and you are a pilgrim who does not come in at the 
little wicket {vide J. Bunyan)," [de Morgan to Whewell, 20th January 1861]. 
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Ex, 3.— Find by NewtoiCs method^ correctly to six places of decimals^ the 
root of the equation 

X = 4*7772393. 

From a table of logarithms we have, by inspection, the values 
6 log^^e = 4*67 and 7 logn,7 = 5*92, 

80 we take as a first approximation 6. 

The next approximation is 

a?! = “/(^o)// (^o)* 

Now /(a;o) = (6x0*77815)-4*7772= -0*1083, 

/'(a^ = logio^o + = 0*778 + 0*434 = 1*212, 

so l//(«o) = 0*825 (nearly), 

and 0 ^ = 6*089. 

The next approximation is 

^ ^ -/(fl5j)//(2Ci). 

Now f{xf) = (6*089 X 0*7845460) - 4*7772393 = - 0*0001387, 
so 6*089 + (0*0001387 X 0*825), 

or aj 2 = 6*089114, which is correct to six places. 

45 . An Alternative Procedure. —Instead of following 
Newton’s rule strictly by forming /(ccb) and /'(®b)> etc., we may 
proceed in a somewhat less elaborate way as follows: 

Suppose we have found (graphically or otherwise) a first 
approximation to the root of the equation /(a?) = 0, which we 
will call ai. Let x denote the required root of the equation 
and put 

a? = ai + 8, (1) 

where 8 is a small quantity. We now substitute this value of 
X in the original equation, neglecting powers of 8 greater than 
the first. We solve the simple equation in 8 so formed and 
denote the value obtained for 8 by 8i. Then the second 
approximation to the root x is ai + 8i. Now denote ai + 8i 
by 02, and write 

a? = 02 + 8, (2) 

and substitute this value oUx in the original equation. Pro¬ 
ceeding as before, we find an approximate value of the 8 of 
equation (2); let it be 82, so 02 + 82 is a third approximation to 
the root; then we denote 02 + 82 by 03, and write 

a? “ 03 -f 8, 


( 3 ) 
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and similarly for further operations. The sequence ai, 02 ,.. 
converges to the root x. This procedure is essentially equivalent 
to the Newton-Baphson process, as is evident from the 
connection pointed out above between the Newton-Baphson 
process and Taylor's Theorem (§ 44). 

Ex. — Find the root of the equation 

10a;-a* = 3-1462644, 

which is near 0*3. 

If a; *a + 5 where 8 is small, we have approximately 
3-1462644-lOai + tti^ 

10-3ai2 

Put ai = 0*3, then 

0-17326 

8 = -gT 7 y- = 0-17326 X 0.10277 5 = 0• 017806. 

The next approximation is etg —from which we have 

3-1462644-3-178-h0*0320968_0-0003612 
10-3x0*101 9-697 

= 0-000037,249, 

so 03 = 0-317 837,249. 

The required root of the equation is 0*3178372, correctly to seven places. 

It may be remarked that the above method is theoretically applicable 
to complex as well as to real roots, but in the case of complex roots the 
numerical calculations are generally so laborious that other methods to 
be described later are preferable. 

46. Solution of Simultaneous Equations. —As a further illustra¬ 
tion of the method of the last section we shall find a solution of the 
simultaneous equations, 

( 1 ) 

+ a;2 - 2xy = 4. (2) 

We first trace the curves represented by these two equations. In the 
firet equation we find, corresponding to given values of aj, the following 
values of y : 


X 0 

+ 1 

>1 -1 

-2 

-3 

- 10-0 

y tO-71 

0 

imaginary if 1 

qf:2-12 

q:3-74 

t22-4, 

and in the second equation similarly 




y 0 

0-6 

+ 1 > + l 

-1 

-2 

-3 

X -2 

-1-4 

+ 1 imaginary 

-4-16 

-8*93 

-15 

+ 2 

-f-2-4 


+ 2-16 

+ 4-93 

+ 9. 


There is evidently only one real root of the equations, represented by 
the point G in the diagram. From the diagram we see that the ordinate 
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of G lies between + 0*7 and + 0'9. Introducing the three values y = 0*7, 
y = 0*8, y = 0*9, into equations (1) and (2), we obtain 


2/* 

X by (1). 

^ X by (2). 

Difference, 

0*7 

+ 0*27 

- 0*96 

+ 1*22 

0*8 

-0*65 

-0*64 

-0*01 

0*9 

-0*85 

-0*18 

- 0*67, 


so if in the two curves x were to vary proportionally to y we should 
deduce that y 'would be near 0*799. If we now try the three values 



0*‘}97, 0*798, and 0*799, this time using logs to 
we obtain 

five places of decimals, 

?/• 

« hy (1). 

X by (2). 

Difference. 

0-797 

-0-6467 

-0*6534 

+ 0*0067 

0-798 

-0-6492 

- 0*6498 

+ 0*0006 

0-799 

-0-6617 

- 0*6460 

- 0*0057, 


from which it is seen that the true values are nearly 
a;=-0*6494 and y= +0*7981. 


We therefore substitute in equations (1) and (2) 

a;=-0*6494 + 8^ and ?y =+0*7981 + 8y, 

neglecting squares of 8x and 8y, Using units of the third decimal 
place, we have 

126582;+31928y =-0*0621, 

28958a; - 10863Sy « 0-0969, 

(d8U) 4« 
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-0-0160, %= -0-0131, 

BO «=•-0-649416, H-0-798087, 

which is the required solution. 

47. Solution of a Fair of Equations in two TTnknowns hy 
Nowton’s Mothod. — A. more formal and general treatment of the 
topic of the last section is the following. Let two equations be given, 

A*.y)“0, fl<a^i/)=0, 

from which the unknowns (*, y) are to be determined. 

Let (o^j, be an approximate solution of tbe equations. Write 

as-a^-t-A, 

Then by Taylor’s Theorem, neglecting powers of h and k above the first, 
we have 


yo)+^S:+*^’ 


giving 


h= 


%o m 


^ ^ . 
3%^ 


3»o 




‘'arp 


M. ^ 

Therefore an improved pair of values for the roots is 

^Tlu *71,1 


3 ^ 3yo 


3^0 aib 


yi“yo+ 




^ 3»o ^0 3*0 

These formulae may be iterated as in Newton’s process for equations 
in one variable. 


48. A Modification of the Newton-Raphson Method.— 
The computations required for the Newton-Baphson method 
may be simplified in the following way. 
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Instead of 

®r+l = (^)> 

we may take as the formula on which iteration is based 

Xr+i = Xf 

This means that in the successive steps of the process of 
§ 44, we replace the tangents at Q, B, . . . by lines parallel to 
the tangent at P. By this method we are saved the trouble of 
calculating f[x^ at each stage, while the number of approxi¬ 
mations required is practically no greater than in the Newton- 
Baphsoh method. 

Ex, 1. —To find the root of the equation 

f(x) = .X® + 4a^ — + lOx^ — 2a; — 962 = 0 

which lies between 3 and 4. 

Here/(3)*= - 365, /(4)= 1110, so by proportional parte we may take 
scqb: 3*3 as a first approximation to the root 
The next approximation is 

*i=*o-/(*o)//(*o)- 

Now 

/{xq) = - 65*85, /(xq) = 5x 4 + 16a^ - 6 x 2 + 20a; - 2 = 1166*6, 
and llf(xQ) = 0-000Sb1 (nearly). 

Therefore = 3*3 + (65*8 x 0*000857) 

= 3*356. 

The next approximation is 

Instead of calculating /'(x^), we may use again. 

Now 

05^ + 4= 7*356, 

a;j2 + 4xi-2== 22*6867, 
x^ + 4x^^^Xj^+\0= 86*137, 

Xi* + Ax^ - 2x^2 + 1 Oxi - 2 = 287*08, 
x^ + + 10xi2 - 2xi - 962 = 1 *44, 

and therefore /(xi)=l*44. 

We have at once 

a;2 = xi-(1*44x0*000857) 

= 3*3660-0*0012 
= 3*3648. 

This is coiTect as far as it gSes, the value of the root to seven places 
of decimeds being 3*3548487. 

J&x. 2. —Compute the root of the equation 

X+logygfli=:^0-5 

correctly to five places of decimals. 
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49 . The Rule of False Position.— Another iteration pro* 
cess belonging to the same class as Newton’s for finding the 
root of an equation is the following. 

Let f{x) = 0 he the given equation. We find (by trial or 
otherwise) two values a and I near the root of the equation 
such that /{a) and fijb) are opposite in sign. Let the arc 
CRD in the diagram denote the curve the abscissa 

of R being the root of the given equation, and suppose that 
the equations /'(») = 0, f{x) — 0 have no root between a and 


o 



I, the abscissae of C and D. The curve is therefore constantly 
concave to the axis of x along one of the arcs CR and Rl), 
and constantly convex to the axis of x along the other of 
these arcs. Let RD be the arc which is convex to the ir-axis. 

The equation of the chord CD is given by 


and the abscissa a' of the point where the chord cuts the 
axis of X, is given by 




(6-a)/(a) 


( 1 ; 


/(«»)-/(4 

Then o' is evidently a doser approximation to the root of the 
equation than a. 
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We now draw the ordinate at A' to cut the curve in C' and, 
as before, we draw the chord C'D intersecting the a; axis in a 
point A" which lies between A' and B. The abscissa a" of the 
point A" is evidently a closer approximation to the value of 
the root than a\ and its value is 


a = a 


{b-ay{a') 


( 2 ) 


and so on for further approximations to the root of the 
equation. 

It is evident that the computation of the root may be simplified if in 
equation (2) we replace h by 6', the abscissa of any point on the curve 
between K and D, as in Ex. 1 below. 


Equation (1) may also be written in the form 


a 


“/(<•)-/(«)■ 


The iterative process based on this equation is known as the 
rule of false position. 

The rule of false position is essentially inverse interpolation 
(§ 34) when differences above the first are neglected. The 
above iteration to the root is valid even when the initial points 
0, D are on the same side of the root B, provided the arc 
RD is convex to the a?-axis. 


Ex. 1 .—To solve the equation 

a? — 2aj — 5 = 0. 


Here we can take a = 2, 6=3, and 


%.e.^ a = 2*00. 

We now take 


(3~2y(2) 

/(3)-/(2) 



a* = 2*06, 6'= 2*10, then 


(2-10-2-06) (-0-378184) 
“"b^l000 + 0-378184 


2-0944 . 


•» 


i.e., a'= 2-094. 

Instead of taking 6'= 2-10 |bs before, we may take 6'= 2-096 since 
/(2-096) is positive. We have 


0-002 (-0-006153416) 
a" = 2-094 - ^016180736 + 0-006163416 


= 2-0946610 


The required root is 2-094561 correctly to seven significant figures. 
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When the rule of false position is written in the form 

m-m m ^ 

b^a ” 

we may express the rule as follows : * 

Assume two numbers as near the true root as possible^ and jmd the error 
arising from the substitution of each of these quantities instead of the 
unknown quantity in the proposed equation ; then as the difference between 
the two errors is to the difference of the assumed numbers so is either error to 
the correction of the corresponding assumed number. 

Assuming this new value a* instead of a and another quantity V 
differing from a* only by one unit in the last place so that V is greater 
or less than a* according as a' is found too small or too great, we then 
find a new approximation a' and so on to any required degree of 
accuracy. 

Ex, 2 .—Solve in this way the above equation 
0 ? — 2a; — 6 = 0. 


60. Gombination of the Methods of § 44 and § 49.—It 
was remarked by DandeUn f that by combining Newton's rule 
with the rule of false position we are in possession of a method 
of solving equations in which upper and lower bounds to the 
value of the root are obtained at every stage of the process, 
so that any digits common to the two bounds certainly belong 
to the correct value of the root. 

Thus using the figure of the last section, and still assuming 
that BD is the arc which is convex to the a;>axis, we draw as 
before the chord CD to cut the axis of x in the point A' 
whose abscissa a' is given by the equation 


of-a 

/(&)-/(«) ■ • ■' 


( 1 ) 


where a, h are the abscissae of the points C, D. Then a' is a 
closer approximation to the root of the equation than a. We 
now draw the tangent at I) to cut the axis of a; in B'. If h 
is the abscissa of D, then V, the abscissa of B', is given by 
Newtmi’s rule in the form 


V 


h- 


m 

f{h) • • •' 


( 2 ) 


and is a closer approximation to the root of the equation 

* Barlow’s MaihemtUieal Tables (1814X p. xxxvi. 
t Mim, de VAead,'Eoyale de Bruxelles 8 (1826), p. 30. 
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than h. It is evident that the root of the equation must lie 
between the bounds a' and V. 

Wo may now operate on o' and h' as on a and b, to find 
two new bounds of the root, namelj, 

a" — a'~ _ j'_) 

/(*')-/(«') ^ ^ fW)’ 

which envelop the root more closely than a' and V, and so on 
for further approximations to the root. 


It may be remarked here that at the conclusion of the process it is 
best to take, as our final value for the root of the equation, the arith¬ 
metical mean of the pair of values last calculated for a and 6. 

Sx ,—Consider again the equation 

/(«) = ai® - 2a; - 5 = 0, 

for which/(«) has a root between 2 and 2*1, while/'(a;) has no root 
between these limits. 


Now /(2) x/'(2)<0 and /(2-1) x/"(2-l)>0, 

so if a a= 2, 6 = 2*1, the curve y=f{x) is convex to the axis when a;=s 6. 
The next approximations are : 


b 


/(2.1)^ 0»061 
/{2-l)~ ^"“11-230 


= 2-09457, 


y(2)x_(2.1_-2J 
7(2.1)-/(2) 


Comparing V and a', we see that 2-094 are digits of the true root 
Now take «' = 2-0943, b' = 2-0946 


and we get 

f(a')(b'-a') 0-000000,841949,4579 

W)-W) ^ 0-003348,048729 


= 2-094651,476, 

J" = 6'-/(J')//'(6') = 2-0946 


0>000541,550536 
11.162047,48 


= 2.094551,483, • 

so the first eight digits of the root are 2.094551,4. 
The arithmetic mean between a" and b" is 
2.094551,48, 


which gives the first nine digits correctly. 
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51. Solutions of Equations by the use of the Oalcnlns 
of Differences. —We have already seen in § 34 how equations 
may be solved by inverse interpolation. We shall now show by 
an example how divided differences (Chap. 11.) may be applied 
for this purpose. 

Suppose it is required to solve the equation 
ysa? + - 12iB -10 = 0. 

The coefficients -10, -12, 3, 1, are of course the divided 
differences of y for the set of coincident arguments 0, 0, 0, 0, so 
we can write down part of a table of divided differences thus: 

X, y. 

0 



and this we shall now extend downward^ The third differ¬ 
ences of y are constant, and therefore if we take as the next 
argument cc = 2, we have 

*. y. 

0 - 

1 

0 - 3 

-12 1 
0-10 6 
-2 

2 -14 


Here the number 5, which is the new divided difference of the 
2nd order, is obtained from the equation (p — 3)/2 = l giving 
p = 6: then the new difference of the first order is obtained 
from (fi +12)/2 =>5 giving g= -2;'and lastly the value of the 
function curresponding to the argument a; = 2 is obtained from 
(r + 10)/2=-2 giving r=-14 In this way we construct 
the following table of divided differences, taking arguments 
suggested by the sequence of values of y already obtained: 
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3S. y. 


0 

-10 

-12 

3 

1 

0 

-10 

-2 

6 

1 

2 

-14 

16-75 

7-5 

1 

2-5 

- 6-625 

23-89 

10-2 

1 

2-7 

-0-847 

26-07 

10-9 

1 

2-7 

-0-847 

26-07 

11-1 

1 

2-7 

-0-847 

26-4039 

11-13 

1 

2-73 

- 0-054883 

26-7387 

11-16 

1 

2-73 

-0-054883 

26-7387 

11-19 

1 

2-73 

-0-054883 

26-761084 

11-192 

1 

2-732 

-0-0013608 


11-194 


Having found 

that 2-7 is 

near the root, 

since y 

is com* 

paratively small, we repeat the interpolation and thus obtain, 


(§ 16 ) 


y = - 0-847 + 26 07(»- 2-7) +11-1(*- 2-7)* nearly, 

giving cc - 2-7 = 0 03 approximately; so we take 2-73 as our next 
approximation. We continue this method until the approxima¬ 
tions are sufficiently accurate for our purpose. 

If we require the approximation correctly to four digits, we 
have a; = 2*732 and the n^xt digit is found to be 0, so the 
required root is 2*732. 

Having obtained this root, we proceed to find approximate 
values for the remaining roots of the equation. Thus we first 
transform y into a polynomial in (»- 2*732), 
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2-73 

- 0-054883 

26-761084 

11-192 

1 

2-732 

-0-001361 

26-783472 

11-194 

1 

2-732 

2-732 

-0-001361 

-0-001361 

26-783472 

11-196 


so 

y = 26-783(® - 2-732) + ll-196(x - 2-732)* + {x- 2-732)* (nearly). 

We may now divide out the factor x - 2-732 and solve the 
quadratic so formed, 

26-783 +11-196 (» - 2-732) + (® - 2-732)* = 0. 

The remaining roots are found to be 

/ x= -5-000 
t and » = - 0-732, 

which are correct to four significant figures. 

(The actual roots of the above equation are 2-7320508, 
-0-7320508, and -5.) 

Ex.—Find correctly to four significant figures the root of the 
equation 

x»-9®* + 23x-14 = 0, 
which is between -f 4 and + 5. 

52. The Method of Daniel Bemonlli.—In 1728 Daniel 
Bernoulli* devised a method wholly different in principle from 
any which were then known. Though hardly now of first-rate 
importance, it is interesting and worthy of mention. 

Let it be required to solve the equation 

. . .+a„ = 0 . ( 1 ) 

Consider the difference equation 

Ojy (< -m) + Ojy (< -t-« -1) + . . . -f (/) = 0. (2) 

The solution of (2) is known to be 

+ . . . ■¥WnX^, ... (3) 

where Wi, wg, . . ., w^ are arbitrary functions of ^ of period 
1 , and . . ., a;,, are the roots of equation ( 1 ). 

* Commentarii Acad, Se, Pdropol III. (1732); Of. Euler, ItUroduedo in Analy, 
1. cap. XYII.; Lagrange, Edi^ohUion des ^uatUms numiriqneB, Note 6. 
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If |a^| is greater than the modulus of any other root, the 
first term on the right in (3) becomes very large compared 'with 
the other terms when t is large; and therefore we have 

!+l) 


a^=Lt - 

<-^ao 


yif) 


This leads at once to Bernoulli’s rule, which is as follows: 

In order to find the absolutely greatest root of the equation (1), 
we take any arbitrary values for y(0), y(l), y(2), . . y[n - 1); 
from these by repeated application of equation (2) we ccdcidate in 
suecessionthevaluesofy{n)yy{n+l),y{n + 2), .... The ratio 
of two successive members of this sequence tends in general * to 
a limit, which is the absolutely greatest root of the equation (1). 

Ex, 1 .—Find the absolutely greatest root of 


flc® + 605* -6 = 0. 


Consider the difference equation 

2/(<4-6) + 6i/(« + 4)-6y(0 = 0, (1) 


and write down arbitrarily the values y(0) == 0,1/(1) = 0, y(2) = 0, i/(3) = 0, 
2 /( 4 )= 1. By means of equation (1), we have the following values of 
y(6), etc. : 


t 

6 

6 7 

8 

9 

10 

11 

12 

13 

y{t) 

-6 

25 - 125 

626 

-3,120 

15,676 

-77,760 

388,126 

- 1,937,600 


Also y(14)= -61/(13)+ 61 /( 9 ), 


so 


2/(14) 51/(9) 166 

2/(13)”" ^■^2^(13)“ 19376' 

= -4-991948. 


The absolutely greatest root of the equation is therefore given by this 
method as approximately - 4*991948 ; this value is as a matter of fact 
correct to the last digit. If we had stopped earlier, we might have 
obtained, e,g ,: 

= 4.99196, 

y(ll) -77,750 


which is in error only in the sixth significant digit. 


Ex, 2 .—Find the smallest root of 

»®-6y* + 9»-l=0 

correctly to seven places by Bernoulli's method, 

(Note ,—Put « = i and solve for x.) 

* If the ratio does not tend to a limit, but oscillates, the root of greatest 
modulus is one of a pair of conjugate complex roots. 
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53. The Bufflni-Horner Method.'*' — As we have men¬ 
tioned in § 41, the method of Vieta was the common method of 
solving algebraic equations until it was superseded by jNewton’s 
method of approximation. The reason why the Newtonian 
approximation was found to be less laborious than its pre¬ 
decessor was that the earlier method contained no provision for 
making the steps of one part of the process facilitate those 
which succeed. 

In 1819 W. G. Horner discovered a rulef for performing 
the computations necessary in Vieta’s method, by which that 
method was made preferable to the Newtonian, and so restored 
to favour. 

Horner’s contribution was essentially a convenient numerical 
process for computing the coefficients of the equation whose 
roots differ by a given constant from the roots of a given 
equation. We shall first consider this process. 

JjQt/ix) = 0 bo a given equation where/(a?) is a polynomial in 
X, and let it be required to find the equation whose roots are 
the roots of this equation, each diminished by r. This equation 
will be f{x + r)« 0 or 

0 -Ar) + xf'{T) + p"{r) + +. . . (1) 

The expressions/(r),/'(r),/"(?’)/2!,/"'(7’)/3!, . . . may now be 
found in the following way. Suppose for example that 

^a;) = Aafi + Ba?* + Ca? + Da^ + Ea: + F. 

Write down the coefficients A, B, 0, D,. . ., F in a horizontal 
row and form from them the following scheme in which a new 
letter below a line stands for the sum of the two immediately 
above it, e.g. P = Ar + B. 

* Ruffini, Sopra la thlermtnazione delle radici, Modena (1804); and Memorie 
di Mat, e di Fts, della Soc. Italiana delle Sdenze, Verona (1813). 

Horner, Phil, Trans, (1819), Part I. p. 30^,* and The Mathematician, 1 (1845), 
p. 109. 

t So far as the cube root is conoerned, it had been given previously by 
Alexander Ingram in the Appendix to his edition of Hutton's Arithmetic 
(Edinburgh, 1807). A method based on the same principles had been discovered 
in the thirteenth century by the Chinese mathematiciaiM. 
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B 

C 

D 

E 

F 

Ar 

Pr 

Qr 

Er 


■p 

Q 

R 

"S 

«> 

Ar 

Tr 

Ur 

Yr 


T 

U 

T 

X 


Ar 

Wr 

Xr 



W 

X 

> 



Ar 

Yr 




T 

</> 





Ar 

e 


It is seen at once that the values of 0, </>, ^^us 

obtained at the feet of the columns, have respectively the 
values 

0 = 5Ar + B = ^'-’"(r), 

<!> = 10 Ar* + 4Br + C = 3^"'W, 

= lOA^-s + 6Br2 + 3Cr + D = ^/"(r), 

X = 5 Ar* + 4 Bj'« + 3Cr* + 2Dr + E =/(r), 

0) = Ar® + + Cr® + Dr^ + Er + F = /(r), 

and therefore the equation (1) whose roots are the roots of the 
equation f{x) = 0, each diminished by r, becomes 

0 = + 6a^ + <fa^ + \l/x^ + x^ + f^- ( 2 ) 


The above scheme therefore enables us to find readily the 
equation whose roots are the roots of a given equation, each 
diminished by a given number. 

This process is first applied in order to diminish a root of 
the proposed equation by its first digit; then it is again applied 
in order to diminish the corresponding root of the resulting 
equation by its first digit, which is the second digit of the 
required root of the original equation; then again in diminishing 
the root of the equation last obtained by its first digit, which 
is the third digit of the required root; and so on. 
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quantity which would be given by the Newton-Raphson 
method as an improved approximation to the root, after r had 
been found as a first approximation. This property is used at 
each stage of the process in order to obtain the next digit of 
the root; in fact, we use the two last terms + «> of equation 
(2) to suggest the next digit. 

We shall apply this method io find the emallest positive root of the 
equation 


By means of a graph of the curves y = x®, y= 4x® - 6 it is readily seen 
that the required root lies between 1 and 2, so we shall first diminish 
the roots of the given equation by unity. 


1 

-4 

0 

6(1 


1 

-3 

-3 


-3 


2 


1 

-2 



-'2 

1 



The equation whose roots 

are the roots of the original equation diminished 

by 1 is therefore 

X® - x® ^ 

6x+ 2 

=0. (1) 


We now form the equation whose roots are ten times the roots of this 
equation ; it is 

> 1 Ox® - 600x + 2000 = 0. 

We want the root of this equation which lies between 1 and 10, the 
other roots being numerically greater than 10. It is found to be between 
3 and 4,* so we diminish the roots by 3, thus : 

-10 - 600 2000(3 

3 - 21 -1663 

- 7 -621 437 

3 - 12 

- 4 - 533 

_3 

- 1 

so the transformed equation is 

+ ( 2 ) 

* By the principles explained above, viz. that the two last terms of this 
equation have the chief influence in determining the root, the two first terms 
being evidently small compared with the two last terms, when x lies between 
1 and 10. 
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Multiplying the roots of this equation by 10, it becomes 
- 1 Osc* - 63300* + 437000 = 0. 

This equation has a root between 8 and 9 (as is seen by the in¬ 
spection of the last two terms since 437000/63300 = 8'19 . , .), so 


we diminish the roots by 8. 

1 -10 

- 63300 

437000(8 

8 

-16 

- 426628 

- 2 

-53316 

10472 

8 

48 


6 

- 53268 


8 

14 

The transformed equation is 

therefore 



*» -H4a:® - 632C8* -|-10472 = 0, (3) 

and, multiplying the roots by 10, it becomes 

a:3+ 140a2- 5326800a; + 10472000 = 0, (4) 


for which an approximate value of the root is 


10472000 

5326800 


or 1-9659. 


Thus, finally, we obtain for the required root of the original equation 
the value 

1-3819659, 

which is in error only by one unit in the seventh place of decimals. 

We may note that approximate values of the other two roots may 
be obtained from the equation at this stage in a very simple fashion. 
For if a cubic equation has two roots M and N, which are numerically 
very large in comparison with the third root c, the cubic is nearly 

a;8 - (M + N)a;2 + MNa; - MNc = 0. 

The other two roots of the cubic (4) will therefore be approximately 
the roots of the quadratic 

140a;-5326800 = 0, 

which are a:= 2239-05 

and a; =—2379-06, 

and therefore the two corresponding roots of the original cubic are these 
values divided by 1000 and increased by 1-38 (the part of the root 
already found), viz. 

x= -0-999 

and a;= 3-619. 


These are in error by one unit in the third place of decimals. 
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We may note that Homei^s Proem is really the formation of a table of 
divided differences. Thus taking the last example again, suppose it is 
required to find the equation whose roots are the roots of 

X = 05® - 4a® + 5 = 0, 

each diminished by unity. We note that the coefficients 6, 0, - 4, 1 are 
the divided differences of X for the set of coincident arguments 0, 0, 0, 0, 
so we can write down the following table of divided differences: 

a X 


0 6 
0 6 
1 2 
1 2 
1 2 


0 


-3 


-5 


-6 


-4 

-3 


-1 


1 

1 

1 

1 


whence, by Newton’s formula for interpolation with repeated arguments 
(§ 16) we have 

X = 2-6(a-l)-(a-l)2 + (a-l)® 
or Xs 2 - 6 y- 2 /® + 2 /®, 


which is equation (1) above. 

Therefore the equation whose roots are ten times the roots of the 
reduced equation is 

X = 2000 - 500a - 10a® + a® = 0. 


Now diminish the roots by 3. This is done by the difference table 
X X 


0 

2000 


-10 



— 

600 


1 

0 

2000 


-7 



— 

521 


1 

3 

437 


-4 



— 

533 


1 

3 

437 


-1 




533 



3 

437 




Xs 

437-533(a- 

.3)-(a 

:-3)® + (a- 

3)®, 


so that 

which gives us precisely equation (2) above; and so on. 

Mx, 1 .—Find to six significant digits the podtive root of the equation 
a® + 3a®-12a-10 = 0 

hy performing three HomePs transformations and then approximating. 
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First, diminish the roots by 2, 


3 

-12 

-10 

2 

10 

- 4 

5 

- 2 

-14 

2 

14 


7 

12 



~9 

Therefore the new equation is 

ic3+9a;2+12a;-14 = 0. 
Multiijly the roots by 10, 

+ 90*2 + 1200a; - 14000 = 0. 


Diminish the roots by 7, 


1 90 

1200 -14000 

7 

679 13153 

97 

1879 - 847 

7 

728 

104 

2607 

7 


111 


Therefore the new equation is 


:e®+llla:2+ 2607*- 847 = 0. 

Multiply the roots by 10, 


!C* + 1110** + 260700* - 847000 = 0. 

Diminish the roots by 3, 


1 1110 

260700 - 847000 

3 

3339 792117 

1113 

264039 - 54883 

3 

3348 

1116 

267387 


3 

1119 


Therefore the new equation is 

a;»+1119a;2+'267387a;-64883 = 0. 

Moreover, 0*206 (nearly), 

267387 


so that the required root is 2*73205. 
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To get a better approximation, we have from the last equation, 
approximately, 

54883 1119 , 

*“267387 267387®^ 

= 0-206267 - (0-0041849)(0-206)* 

= 0-205257-0-000176 
= 0-206081, 

80 the required root is 2*732050,8. 

Ex. 2. —Find correctly to seven significant digits that root of the equation 
jc®-9a;2 + 23a;-14 = 0 

which is between +4 and +5, performing three Homer^s transforma^ 
tions and then approximating. 

Ex. 3.— Find by Horner^s method a root of the equation * 

a?-2x=5. 

54 The Root-squaring Method of Dandelin, Loba¬ 
chevsky, and Qraeffe. —^We shall next consider a method 
of solving equations which was suggested independently by 
Dandelin t in 1826, Lobachevsky J in 1834, and Graeffe § in 
1837, and which is frequently of great use, especially in the 
case of equations possessing complex roots. It has the 
advantage (when performed completely) of finding all the 
roots at once and of not requiring any preliminary deter¬ 
mination of their approximate position. Its principle is to 
form a new equation whose roots are some high power of the 
roots of the given equation: suppose we say the 128th power, 
so that if the roots of the given equation are Xi, Xz, Xz, . - ., 
then the roots of the new equation are x^^, x-^^, Xs^^, . . . 
These numbers are widely separated ; thus if Xi were t\yioe 
then would be more than 10®® times aj2^®®, and, as we shall 
see, an equation whose roots are very widely separated can be 
solved at once numerically. 

* A pupil of De Morgan by Homer’s method found the root of Wallis’s well* 
known example oi? ~ 2jc= 5 to 51 places to be 

a:=2*094 551, 481 542, 326 591, 482 386, 540 579, 302 963, 857 306, 105 628 239. 

This was subsequently extended to 101 decimal places; cf. The Mathematician^ 
8 (1850), p. 290. 

fMim. de VAead. Boyale de Bruxdles^ 8 (1826), p. 48. 

X Algebra or Caleulus of Finites, Kas&n (1834), § 257. 

%Avfi6sung der h&heren numerisehen Oleiekungen, Zurich (1837). The method 
was perfected by Brodetsky and Smeal, Proe. Oanib, PhU. 8oe.t 22 (1924), p. 83. 
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Suppose it is required to solve the equation 

+ + ^ . + an = 0. (1) 

In this equation we shall denote the roots, which will for the 
present be assumed to be real and unequal, by -a, -B, -c, 
- d, . . the order being that of descending numerical magni¬ 
tude, so that |al>|B|>|c|>|d| . . . The values a, B, c, d, . . 
which are the roots of the equation reversed in sign, will be 
called the Enche * roots. Equation (1) may now be written in 
the form 

a5”+[a]a?”‘i + [aB]a;"‘® + [a&c]a;"“® + . . . = 0, (2) 

where [a] denotes a + B + c + (Z + . . that is, the sum of the 
Encke roots, [aB] denotes aB + ac + Be +. . the sum of the 
products of the Encke roots taken two at a time, and so on. 
If, moreover, we denote the sum + B”® + +... by [a"‘], 

then the equation whose roots are the mth powers of the roots 
of the given equation will evidently, if m is even, be 

+ . . = 0. (3) 

The problem before us is to construct the equation (3) when 
the equation (2) is given, and m is some prescribed number. 
In practice m is a large number in the equation (3) which is 
ultimately formed, but we do not attempt to construct this 
equation at a single step of the process; instead of this we first 
take m=2, that is to say, we form a new equation whose 
Encke roots are the squares of the Encke roots of the original 
equation; then, having done this, we repeat the process, 
forming a new equation whose Encke roots are the squares of 
the Encke roots of the equation just obtained—that is, the 4th 
powers of the Encke roots of the original equation—and so on. 

Thus our immediate problem is to construct the equation 
(3) when equation (2) is given and m has the value 2. This 
we do in the following way. Rearrange the given equation 

+ + + . . .4-an = 0, (1) 

SO that the terms containing the even powers of x are on one 
side of the equation, and the terms containing the odd powers 

• Encke, Journal fUr Math, 22 (1841), p. 193. 
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are on the other side. Squaring both sides of the resulting 
equation, we have at once 

or, putting= 

+ (^1® ” 2(42)^” “ ^ + {tt2^ - 2aia3 + 2a^y^ =0. (4) 


Since the roots of equation (1) are - a, - 6, - c,. . and 
- a?* =*y, the roots of equation (4) are - a®, - ft®, - c®, . . .; that 
is to say (writing x in place of y), the equation whose Encke roots 
are the squares of the Encke roots of (1) is 


sc*»+ai2 

-2oa/ 



+ 04 * 

— 2413(15 + 2(t2Glg 
-2aya>j 
+ 2ag 




+ . 


. .= 0 . 


(5) 


The law of formation of the coefficients in equation (5) may 
be stated thus: The coefficient of any •power of x is formed by 
adding to the square of the corresponding coefficient in the 
original equation the doubled product of every pair of coefficients 
which stand equally far from it on either side, these products 
being taken with signs alternately negative and positive. 

Haying thus formed the equation whose Encke roots are the 
squares of the Encke roots of (1), we repeat the process, thus 
obtaining an equation whose Encke roots are the squares of 
the Encke roots of (5), that is to say, the 4th powers of the 
Encke roots of (1). The next stage yields an equation whose 
Encke roots are the 8th powers of the Encke roots of (1), and 
BO on. 

Now consider the equation which is obtained when the 
process has been repeated several times, so that we have calcu¬ 
lated the equation (3) where m is (say) 64 or 128 or 256. 
Since a is numerically larger than J, therefore is enormously 
larger than 6”^ or c^^ ox . ., and thus the sum [a*"] bears 
to its first term a*** a ratio which is very near to unity. Simi¬ 
larly [a”^b^] bears to its first term a ratio which is very 
near to unity, and so on. 

If, then, the ratio of [a’^] to a’** is 1 + c, where € is small, we 
have 


log [a*"] « m log ]al + log (1 + c) 



ALGEBRAIC & TRANSCENDENTAL EQUATIONS 109 
or logM = ^log[a>»]-ilog(l+e). 

Since we have ealcukted [a«*], the right-hand side of this 
equation is known except for the small quantity — log (1 + e), 

which we may neglect; and thus [a] is determined. 

Next, since =o”‘6"*(l +y), where y is small, we have 

log I®^1 = ^ log neglecting ^ log (1 + y), 

and therefore 

log 1^1 = ^ log [«’"&“] - “ kg [o™], 

which determines the modulus of the second root b, and so on. 

In solving an equation by this method, it is all-important 
to know when to stop. Obviously tlie time to stop is when 
another doubling of m would give a result not different (in the 
digits we wish to include) from the result which would be 
obtained by stopping at once; that is to say, when the 
coefficients ... of the new equation are 

practically nothing but the squares of the corresponding 
coefficients [a’^], ... in the equation already obtained. 

Eoc. 1.— To solve the equation 

jc® + 9 £c* + 23« +14 = 0. 

The equation whose Encke roots are the squares of the Encke roots 
of this equation is 

01® + 35x2 + 2'77;<; +196 = 0. 

The equation whose Encke roots are the squares of these is 
a;® + 67 lx® + 63009x + 38416 = 0, 
and proceeding in this way we obtain the following equations: 

X®+ 324223x®+3-91858 X 10»x + 1-475789 x 10® = 0. 
x® + 9-72834x 10%®+1-535431 x 10%+2* 177953 X 10i® = 0. 
x® +9-43335 X 10®ix®+ 2-357548 x 10®®x +(2-177963 x 10i«)® = 0. 
aj? + 8-898762 x 10%® + (2-357648 x 10®®)®x + (2-177953 x 10^®)^ = 0. 
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Stopping at this stage, we have 

log {a^) = log (8-898762 X 10«) = 43*9493296, 

whence log |a| = 0*6867083, 

so the numerically greatest root i8± 4*860806. 

Also log log (2-367648 x 10«») 

= 76*7449212, 

BO log |6l=^if(76*7449212-43*9493296)=:0*6124311 

= log 3*2541016. 

The next root is therefore +3-2641016. 

Lastly, 

log a«^6«^c«4=:4 log (2-177953 x lO^*), 

so log |cl = ^\^(73*3521936-76*7449212)= 1*9469886 
= log 0*8860924. 

The numerically smallest root is therefore ± 0*8850924. 

A rough graph shows that all the roots are negative; they are 
therefore 

-4-860806, -3-2641016, and -0*8860924. 

We do not usually require to compute each of the roots of an 
equation to the same degree of accuracy, although we may require 
to know the approximate values of all the roots. We shall 
therefore give here a more rapid way of computing the roots to 
two or three significant figures, in which we use a 4-place table 
of squares, such as Barlow’s Tables, and 4-place tables of 
logarithms. We shall denote by the equation whose roots are 
minus the mth powers of the roots of the original equation, and 
we shall write, e.g,, 1896*^ for the number 1*896 x 10®. 

Ex. 2 .—To solve by this method the equation of Ex. 1; 

a:^ + 9a;*+23x+14 = 0. 

We first arrange the coefficients of the equation in order : 

a;^. a^. x. sfi. 

1 9000 2300^ 14001 

Squaring these coefficients and adding the doubled products as in 
equation (5), we compute then p^, p^, and so on, arranging the values 
of the successive equations in tabular form, thus: 
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1 

81001 

-4600 

6290® 

- 2520 

1960* 

p* 

1 

36001 

2770* 

1960* 


1 

1225* 

-0554 

7673* 

-1372 

3842* 

p* 

1 

0671* 

6301* 

3842* 


1 

4602* 

-1260 

3970* 

- 0052 

1476* 

pS 

1 

3242* 

3918* 

1476* 


1 

105111 

-0078 

16351* 

21791* 

yM 

1 

097311 

16361* 

21791* 


1 

9467*1 

-0031 

2356** 

4748** 

p32 

1 

9436*1 

2366** 

474836 

pM 

1 

8904** 

6561’* 

2264’* 


Examining the coefficients of we see that if the coefficients of 
pi28 ^ere formed they would be (to four significant digits) the squares of 
the coefficients of so we stop the process here and coinijute the roots 
of the equation as in Ex. 1, using, how'ever, 4-place tables. We have 

log = 43-9496, 

whence a = + 4-861. 

In Older to determine the sign of the root a, we note that /(— 4*8) 
= -I- 0-368 and/( -49)= - 0*259, where f{x) = ac® + 9ic® + 233; +14. The 
equation has therefore a root between — 4-8 and — 4*9, and we have 

a= -4*861. 

Also log = 76*7444; subtracting log a®®, we find log | &|=log 3*264, 

whence 

h= -3-254. 

Again, log a®*6®®c®® = 73*3630 ; subtracting the value of log ft®®6®®, we 
find log |c| = log 0*8861, whence 

c= -0*8851. 

The required roots correctly to four significant figures are 
-4*861, -3*264, -0*8861. 
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Ex. 3. —Apply the root-sq^iaring method to approximate to the roots of 
the equation 

a;3 + l0a;2+31a; + 30 = 0 

to three significant digits. 

Ex. 4 .—Find correctly to seven digits the roots of the equation 
a;®+12x2 +44a;+ 48*0. 

56. Oompaxison of the Boot-squaring Method with Bernoulli’s 
Method. —The root-squaring method may be regarded as belonging to 
the same general type as that of Bernoulli, with which it may be connected 
in the following way. 

Let the equation whose roots are to be determined be 

x»‘ + aiX"-l-f-a2X»“2 + . . . + a;i=(). (1) 

Bernoulli’s principle (§ 52) is that, if we select arbitrarily any values for 
y(0), 1 /( 1 ), . . ., i/(ii - 1), and then determine i/(n), i/(?i+ 1), etc., in succes¬ 
sion by the equation 

y(« + n) + etiy(« + ii-!) + • . . + arty(0*=O, (2) 

then the ratio of two successive members of the sequence of i/’s tends in 
general to a limit, which is the absolutely greatest root of the equation. 
Now let Sp denote the sum of the ptli powers of the roots of equation 

(1) , and take 2 /( 0 ) = So, i/(l) = Si, . . ., - 1) ^ Since each root 

of equation (1) satisfies the equation 

. . + a,ia;^ = 0, (3) 

it is evident that if we form n equations by substituting in this equation 
the different roots in succession, and then add these n equations, we have 
at once 

BO that Sfy considered as a function of t, satisfies the difference equation 

(2) . We see therefore that with the values we have chosen for the first 
n of the y% y{t + n) will be equal to St^ui Bernoulli’s formula yields 
the result that the absolutely gi-eatest root of the equation (1) is 

Lt 

*->OC ** 

This is obviously closely related to the result of the root-squaring method, 
by which the root in question is given in the form lit 

* The value of the numerically greatest root of an equation in the form 
where p, y, . , . are the roots, and a>j 8 > 7 , had 
been given as early as 1776 by Waring in his Meditationes Atmlyticae, p. 311. 
Euler used the method to find the roots of Jo(x) in 1781. 
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56. Application of the Boot-squaring Method to deter¬ 
mine the Complex Boots of an Equation.— If the roots of an 
equation are not real the method of § 54 is equally applicable. 
Suppose, for example, that the equation is of the 5th degree, 
the roots in Encke’s sense being a, 6, c, where a, 5, c 

are real and |a[>r>|6|>|c|, so that the equation is 

{x + a)(x + re^^){x + re-^^)[x + b){x + c) 0. 

The equation whose Encke roots are the mth powers of these is 
{x + + h‘^){x + c”') = 0, 

and if m is a large number, and we retain only the dominant 
part of the coefficient of each power of a?, this reduces to 

a? + + 2a”*r”*' cos 

+ a’= 0. (1) 

The root a may be computed at once by the method already 
given in § 54. We now proceed to find the other roots. 

It is evident from equation (1) that, corresponding to a 
pair of complex roots, there will be one coefficient, namely, 
cos which fluctuates in sign when m takes in succes¬ 
sion a set of increasing values (owing to the presence of the 
cosine factor). We also see that the value of corresponding 
to the pair of complex roots may be computed from the 
coefficients in the terms immediately preceding and immedi¬ 
ately following the fluctuating term: in the above equation 
Having found the value of we now find 
the values of the Encke roots 6, c from the two last coefficients 
in equation (1). Let a', be the actual roots {i.e. roots with 
their proper sign) corresponding to the Encke roots a, 6, c. 

In order to compute the complex roots re'^'y we write 
them in the form u + iv and u-iVy where u-r cos = r sin 
so that re^ + = 2u. If the original equation is written 

sc® + ajSC® + aaSC® + + a/i^X + as = 0, 

we see at once that the sum of the roots satisfies the relation 
0 L\ = "J" 2u + 5 •i' c y 

from which u may be found. Since u, r® are known, v may 

(D311) 5 
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now be found from the relation r* = + /y 2 jhe two complex 

roots are thus determined. 


Eq^ —1. Solve the equation 




as*- 

2ic® - 6ic* + 25a; 

-28 = 0. 



a?*. 

1 

aP. 

-2000 

aP, 

-6000 

X. 

2600* 

sfi. 

- 2800* 


1 

4000 

12000 

36001 

10000 

-6600 

6260* 

-3360 

7840* 

pa 

1 

16001 

8000* 

2890* 

7840* 


1 

2660* 

-1600 

6400® 

-9248 

1568 

8352* 

- 12644 

6147® 

p4 

1 

0960* 

-1280® 

-4192* 

6147® 


1 

9216* 

2560 

1638« 

8049 

1229 

1767® 

1674 

3779** 

p8 

1 

1178* 

1092* 

3331® 

3779** 


1 

1388* 

-0218 

1192“ 

-0786 

0008 

1110“ 

-0826 

1428*® 

pie 

1 

1170® 

0416“ 

0285*® 

1428*® 


1 

1369“ 

-0008 

1722** 

-0667 

8123*® 

- 11862 

2039*® 

p32 

1 

1361“ 

1065** 

- 3729»® 

2039*® 


1 

1862** 

1113“ 

0102 

1391*® 

-4302 

4168®* 

p64 

1 

1862®* 

1216“ 

-2911*® 

4168®* 


1 

3430®* 

1476*®® 

0011 

8474**® 

-10104 

1729*®® 

pl28 

1 

3430** 

1487*®® 

[-2630**®] 

1729*®® 


It is evident that further equations are obtained by simply squaring 
the corresponding coefficients of the previous equation with the exception 
of the coefficient of x (which fluctuates in sign, thus indicating the 
presence of a pair of complex roots). The process of forming new 
equations may therefore be stopped here. 
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Taking logs, wo obtain tlic results 

loga^®®= 64-5363, whence log ja =0-5042 and a = 3-193(-). 

loga^®®6^®®= 108-1724, whence log \h =0-3409 and 6 = 2-193( + ). 
log .266 _ 185 - 2377 ^ whence log r® = 0-6021 and r® = 4-000. 

It may be remarked here that the most important niiinbers in 
determining the values of the quantities a, 6, r® are the indices 
representing the powers of ten in the factor of the coefficients in the 
final equation. For example, in the equation denoted by we obtain 
practically the same value of the roots a, 5, if we neglect the last two 
digits of the numbers 3430 and 1487; but an error in the indices 
64, 128 would give quite a different result. Since the above numbers 
cannot be relied upon as regards the accuracy of the fourth digit, we 
shall consider the final values of the roots to be correct only to three 
significant figures. 

In order to compute the values of the complex roots u ± iv, we see at 
once, from the coefficient of ic® in the original equation, that 

2m-3-193+ 2-193 = 2, 

whence 1-600, i?= ± ± 1-3228, 

and the complex roots arc 1-500 ± l-323i. 

Thus the required roots of the equation correctly to three signifi¬ 
cant digits are —3-19, 2-19, l-50±l-32i^. (As a matter of fact the 
above values of the roots are correct to four significant digits, the roots 
correctly to six places being - 3-192582, 2-192582, 1-5 ± l-322876i.) 

Ex, 2. —Find the roots of the equation 

- 6a;® + 1 lac® + 2a; — 28 = 0. 

57. Equations with more than one pair of Complex 
Boots. —The case where the given equation possesses two 
pairs of complex roots presents little additional difQculty. 
Suppose the process is applied to the equation 

®»++ aa*“"* +. . . + a„_l® + a„ = 0, (1) 

and that the coefficients of two of the powers of x are found to 
fluctuate in sign, thus indicating the presence of two pairs of 
complex roots. Denote these complex roots by re**, re"**, r'e*^, 
r'e"**. We shall assume that the values r®, r'®, together with 
the actual real roots a', b', e', have been computed as in the 
last section. 

Writing u=r cos 9, v=r sin 9, cos 0, v' = r' sin ^ the 
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complex roots are denoted by u±iv, and u'±iv\ and their sum 
is 2(/6 + u^). Writing equation (1) in the form 

(a?-a')(a?- V){x - c'). . . (a?- 2ux + r'^)(x ^- 2u*x + r'^) = 0, (2) 

we see that the coefficient of x in this equation is a linear 
function of u, v/ which we shall call u'). 

Then u') = -i. (3) 

By comparing the coefficients of a5"“^ in equations (1) and (2), 
we find 

— ttj == Qt* + &^ + c +. • • + 2w + 2vt» (4) 

The two last equations express the unknowns u, v! in 
terms of the known quantities a\ h\ c', . . r'* and may now 

be solved for v !; the values of i?, v' being then determined 
from the equations The two pairs of 

complex roots are thus known. It is evident that a similar 
method to that given here may be applied to solve equations 
with more than two pairs of complex roots. 

Ex.—To solve the equation 

a’+ 7ii* + 2l!e® + e3a!« + 147®*+189*® + 119® - 451 = 0. 

As ill Ex. 1, § 56, wo first form the equations for y®, p®, as 

follows: 


1>* 

1 

7000 

»•. 

-1470* 

x*. 

0203* 

*•. 

0911* 

X*. 

-6610* 

w. 

1846* 

2034* 


1 

3430* 

3699* 

1666* 

4865’ 

-0133* 

6690“ 

4137“ 


1 

0436* 

0391* 

-1366“ 

6964“ 

-6389“ 

3249** 

1711“ 


1 

1119* 

2860” 

-4086“ 

3644** 

-1621" 

1068" 

2928" 

pit 

1 

0682“ 

9044“ 

-0411" 

1201" 

-6083“ 

1119" 

8573“ 

PM 

1 

2842“ 

8173" 

-2004" 

1438“* 

-0104“* 

1262”* 

7360*" 

pin 

1 

6442“ 

6680“* 

1665“* 

2068*" 

-3690"* 

1668*“ 

6402*" 

p« 

1 

4016“* 

4462*»* 

[0009*"] 

4277*“ 

[0640*"] 

2469*" 

2918*" 


The coefficients of ®® and a® fluctuate in sign, whence we infer the 
presence of two pairs of complex roots. Taking logs, we have 


256 log a= 141'6037, whence 
256 log 05=279-6496, 

256 log a5r® = 504-6311, 

S66 log a&r®r'® = 688-3908, 

266 logo6c®r®r'® = 679-4661, . 


log la| = 0-5531 and «'= -3-674. 
log 15| = 0-6392 6' = - 3-461. 

1<^ t® = 0-8788 r®= 7-666. 

log r'* = 0-7178 r'®= 6-222. 

log |c| = 1-9652 e'= 0-923. 
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In order to compute the values of we write down the value of 
the coefficient of a;® in the original equation, 

- 7 = - 3-574 - 3-461 + 0-923 + 2w + 2w', (1) 

and then the value of the coefficient of 

119 = + b'c* -f c^a') -f 2a'b*c\ur'^ + u'r^). (2) 

Substituting in these equations the known values of a\ b^ c\ r*, 
r'*, we have 

14 = 0-681, 14'= - 1-126, 

whence u = 2-665, i;'= 1-989. 

The required roots of the given equation are 

- 3-574, - 3-461, 0-681 ± 2-666t, - 1-125 ± 1-989^ 0-923, 

in which values the last digit is uncertain. 

[The roots correctly to four significant digits are -3-578, -3-468, 
0-684 ± 2-6641, - 1-128 ± l-987i, 0-923.] 

58. The Solution of Equations with Coincident Roots 
by the Root-squaring Method.—When the root-squaring 
process is applied to an equation possessing coincident roots, it 
does nob lead ultimately to an equation in which every co¬ 
efficient is the square of the corresponding coefficient in the 
preceding equation. We shall now consider in what way the 
process may be applied to solve such an equation. 

Suppose that the Encke roots of the given equation f{x) = 0 
are a, b, V, Cy d, . , , where h, V are coincident roots, and 
where |a| > {6|> |c|>|4i| . . . The equation, whose Encke roots 
are the mth powers of those of the given equation, will be 
denoted by 

+ + . . = 0, (1) 

and, retaining only the dominant term in each coefficient, for 
large values of m this equation reduces to 
5^ + +... = 0. (2) 

We see that the coefficient of does not follow the usual 
rule, viz. that when m is doubled the coefficient is approximately 
squared. Here, on the other hand, when m is doubled, the new 
coefficient is approximately half the square of the old one. 
This observation enables us to detect the presence of a repeated 
root. It is evident that the value of b may be computed in 
much the same way as r* in the case of complex roots, namely, 
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by forming the quotient of the ooeffioienta immediately follow¬ 
ing and immediately preceding the irregular coefficient. Thus 
from equation (2) we have 

the remaining roots, a,e,d,... being computed as before by 
the method of § 54. 

Ex, 1 .—Solve by the root-squaring method the equation 

a;»+7a;2+16a;+12«0. 

The successive equations are 

+17ic2 + 88a;+144 ^ 0. 
y +113a;2 4- 2848aj 4- 20736 = 0. 

f ics 4 -7073a;2 4-3*425 X 10«a-4-4-300x108 = 0. 

4-318 X 107a;2 4-6*648 x 10i2a;4-1-849 x 10i^ = 0. 
jp 82 1.853 X 10i6a;2 4-1-593 x 1026a;4-3-419 x 1034 = 0. 

It is evident that the coefiicients of the equation would be the squares 
of the corresponding coefficients of except in the case of the coefficient 
of X, We therefore denote the Encke roots of the equation by a, 6, 6', 
where 6, 6' are approximately equal, and where |a| > |6|. Taking logs, 
we have 


log a®2= 16-2679, whence log |a| = 0-4771 and |a| = 3-000, 
log = 34-6339, whence log 16 =0-3010 and 16 =2-000 

= | 5 '|. 

The actual roots of the given equation are - 3, - 2, — 2. 

Ex. 2 .—Calculate by the root-squaring method the greatest root of the 
equation 

os* - 7a? 4- 7a;8 - 7a; 4- 7 =s 0. 

Use the result to prove that the other three roots are nearly equal in modulus^ 
but that only one of them is real 

59. Extension of the Root-sqnaring Method to the Roots 
of Functions given as Infinite Series. —The root-squaring 
process depends essentially on the fact that the coefficients of a 
polynomial (in which the coefficient of the highest power of sb is 
unity) are t^ elementary symmetric functions of the roots; or, 
if the polynomial is divided throughout by the term independent 
of X, BO that the term independent of x becomes unity, the 
coefficients are the elementary symmetric functions of the 
reciprocals of the roots. This, however, is true not merdy for 
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polynomials, bat for all entire transcendental functions of genre 
zero,* and hence we can readily see that the root-squaring 
process is applicable to any entire transcendental function of 
genre zero; and, indeed, as has been shown by Polya,f the 
method may be used in connection with functions of any finite 
genre. 

Ex. 1.— To find by the root-eguaring method the loioeet roote of the Fourier- 
Beml function. 

The Fourier-Bcssel function may be defined by the power-series :|: 

gi4 ^6 

Jo(*) = 1 ~ p + pja ~ 2*4*. 6* ■ ■ ' 

35^ 35® SC® X® 35^ SC® 

where x= — Evaluating the terms on the right side of this equa¬ 
tion, and retaining eleven places of decimals, we liave 

Jo(«)=l+a5 + 0-263;2 +0-027777,777783;® 

+ 0-001736,111113;^ 

+ 0-000069,444443;® 

+ 0-000001,929013;® 

+ 0*000000,039373;7 
+ 0-000000,000623;®. 

The equation whose roots (in Encke^s sense) are the squares of the 
roots of this equation is 

0 = 1 + 0-5a; + 0-010416,666663;® + 0-000038,680243;® 

+ 0-000000,043063^ + . .. 

'J'herefore the equation whose roots are the 4th powers of the roots 
of the original equation is 

0=1+ 0-229166,666673;+ 0-000070,012833;®+ 0-000000,000603? + . .. 
and the equation whose roots are the 8th powers of those roots is 


0= 1 + 0-062377,336173; + 0-000000,0046263?. 
Evidently we can take 


3?a= 


1 

0-052377 


19-0923, 


whence — 1-4468 and is = 2 V —a? = ±2*4048. 


♦ On the subject of genre cf. Borel, Lemons sur lee fonctions erUiires 
(Paris, 1900). 

t ZeUechrift fUr Math. 63 (1915), p. 275. 

* X Whittaker and Watson, Modem Analysis, § 17.1. 
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i: 2 U 


Therefore the lowest root of Jq(«) is a = 2-4048. 

Eqg, 2 .—Calculate tt from the property that 7 r /2 is the lowest root of the 
series for cos x, i,e. 


0-1-^ + ^* 
2!^4I 


sc® sc* 

61 ■^81 


60. A Series Formula for the Root. —A method published 
in 1918 * is somewhat different in character from those which 
have been described hitherto, since it furnishes a literal formula 
by mere substitution in which the root is obtained. I'he 
result may be stated thus: 

The root of the equation 

0 = «o + ^ia5 + a2a5* + flj3a5® + a4aJ® + . , . (1) 

which ie the smallest in absolute value, is given hy the series 



As a numerical example, consider the ecpiation 
4a;*-321a;+ 20*0. 


Here aQ* 20 , — 321, — 4, 1 . 

I a^ a^ * 103121, (ij (I 2 ^3 “ “* 3312*7121, a^ <Zg I * 337* 

I ^ ^2 I 

0 OoOi 

The smalleBt root of the equation is therefore 

20 _ 20* X 4 20® X 337 

321 321 X 103121 103121 X 33127121 ^ ‘ 

or 0-062306,30 - 0-000048,36 + 0-000000,79, 

or 0-062257,73 correctly to seven decimal places. 

The series converges rapidly when the ratio of the smallest 
root to every one of the other roots is small. In calculating 

* Whittaker, Proe. Edi», Math. Sot. S6 (1918), p. 103. Cf De Morgan, 
>/barn. Iwt. Aet. 14 (1868), p. 358. 
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any root of a given equation by the formula, it is therefore 
advisable in many cases first to transform the given equation 
by two or three root-squaring operations each of which re¬ 
places the equation operated on by an equation whose roots 
are the squares of its roots; or else, if an approximate value of 
the root is known, by Horner’s process to reduce the roots of 
the given equation by a, where a is an approximate value of 
the required root, so that the required root of the new 
equation is small compared with any of the other roots. 

Proof of the Formula .—Let the roots of equation (1), supposed 
for the present to be of degree n, be aji, X 2 y . . ., aJh* Then if z 
be any number whose modulus is smaller than each of the 
moduli of the roots, we have 

_Oo__^_1_ 

+ + + - . . . {l-zjXn) 

= (1 + zjXi + Z^jxT? + . . . ) (1 + zjXi + Z^/Xz^ + 

{i-\-zlXn + Z^IXn^ + . . . ) 

= 1 + ViZ + V^z^ + + . . . 


where P,. denotes the sum of the liomogeneous powers and 
products of the reciprocals of the roots taken r at a time. 
Therefore 

Oo = (oo H aiz + a2Z^ + • • •) (^ *^ + ...)• 

Equating coefficients of powers of z, we have 

0 == + ttoPi, 

0 = ^2 + ttiPi + CJoP2» 

0 = Os h a2Pi + + ®oP3, 


whence * 


■p ^ "P ^ ^ 

^1= ““i 

do dff 


di do 

1 

di do 0 

, P8=---3 

Oo® 

d2 dx do 

d2 d\ 

do d2 d\ 


, etc. (3) 


Now since 


a\ (I 2 
do 


= Oi* - Oofla, we see that the first two terms 


* The formulae of equation (3) were known to Wronski, Tntrod. i la philos. 
dea math. (1811), Paris. 

(d311^ 5« 
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of the series (2) are equivalent to the single term 


Oj\ 6^2 



( 4 ) 


Moreover, by 
jugate we have 


Jacobi’s theorem on the minors of the ad- 


dx d% d% 



2 


O/Q dx d% 

= 

dx d% 

— /» 2 

d% dz 

0 Oq 


Oo dx 

- Oo 

dx d% 


and this shows that the term (4), together with the third term 
of the series (2), is equal to 



Cli (^2 

Op ax 
dx ^ ^ 
CLq Oix (I2 
0 Oq ai 



Again, by Jacobi’s theorem we have 


( 5 ) 


dx d% 


dx d%dzd^ 


dx d% dz 

®-Oo* 

dzdzdi 

dfx dx 


dz dx dz dz 


dQ dx dz 


dx dzdz 


0 dz dx dz 


0 do dx 


do dxdz 


0 0 Oo 






and this shows that the term (5), together with the fourth 
term of the series (2), is equal to 


dxd^Qi^ 
CLq (iidz 
0 Oq Ol 
Ctrl d/% 0/% 
df^ dx (1% d% 
0 Oo Oi Og 

0 0 Oo oq 


or 




which is therefore equal to the sum of the first four terms of 
the series (2). Proceeding in this way, we see that the sum 
of the first a terms of the series is equal to P,_i/P^ 

If now for simplicity we consider the case when n»>2, so 
that there are only two roots, and sb*, of which we shall 
suppose Xi to have the smaller modulus, we have 
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J_ 1 1 1 


»2' 




1 

-^ + 


+ ---- 


Xi* 


1 

‘^»2* 


1 + 


Xj «.*-l 


®1 

- 4- + - - 


= - O 1 . 

- Xi x^ a?!* 

1 + — + “4 +. . . + -Arn + 

a?2* 


and since 


X2 Xi 

a;2 aj2* x^ 

< 1, this gives at once 

Lt -p~=a?i. 

<^00 I g 


.*“1 




Similar reasoning leads to the same result when n>2. 

Thus the sum of the first s terms of the series is equal to 
‘A.s s increases indefinitely this tends to the limit X\, 
where Xi is that root of equation (1), which has the smallest 
modulus. Thus the theorem is established. 


Ex, 1. —Find the root, which is smallest in absolute value, of the equation 
+ 14a;2.53268a; + 10472 = 0. 

Ex, 2.— Find the numerically smallest root of the equation 
- 26a;^ + 200a;3 - GOOa;* + 1200a; - 500 = 0. 

61. General Remarks on the Different Methods. —The 

process of calculating a root of an equation /(a;) = 0 may be 
regarded as consisting in general of three stages: 

I. Locating roughly the position of the root, — This is, in 
general, best done by plotting a rough graph; in the case when 
f[x) is a polynomial, it may also be done by divided differences, 
or by the rules of Descartes, Fourier, and Sturm, which are 
explained in works on the algebraical theory of equations. 

II. Transforming the equation so as to isolate this root from 

the other roots of the equations ,—I..et Xi be the required root of 
the equation f{x) = 0, and let x^, Xs,,., he the other roots. 
Then, performing a transformation x = <h(y)i where (#> is some 
given function, we obtain an equation F(y) = 0, which we shall 
suppose to have roots yi, ya,. . . corresponding respectively 
to the roots O!^ o!^, . . . This transformation is to be chosen 
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so that the root yx bears to the other roots y 2 , ys* • • < ratios 
which are small in absolute value. The root yx is then said to 
be isolated. For this purpose we may use, for example, Horner's 
process, or the root-squaring process or inverse interpolation. 

III. Calculating the root of the transformed equation .— 
This may be done rapidly by one of the iterative methods, e.g. 
Newton's (§ 44), or Bernoulli's (§ 52), or by Whittaker's series 
(§ 60). Having found yx, the required root of the original 
equation is given by the formula xx = 

62. The Numerical Solution of the Cubic. — For the 
solution of cubic and quartic. equations some special methods 
are available which cannot be applied to equations of higher 
degree. To one of these, known as the trisection method of 
solving the cubic, we sliall briefly now refer. The term in x^ 
may be supposed to have been removed in the usual way, so that 
the cubic may be written in the form 

Q?-qx-r- 0 . 

There are two principal cases to consider: 

(i.) If 27^2 >4^®, the cubic has one real root and two complex 
roots, which may be found by Cardan's formula 

» = {Jr + (Ir* - ^V2*)¥ + (1) 

or else (if q and r are both positive) by finding <l> such that 

, , 3? r 

C08h<^=2 

when the real root is given by formula (1) 

®=J^^C08h^^, (2) 

or (if q is negative and r positive) by finding </> such that 
. , ^ 35 r 

\rhen the real root is given by the formula 

2 

»=^(-#Binh 


( 3 ) 
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We can always suppose that r is positive, since c hanging the 
sign of r merely changes the signs of the roots. 

(ii.) If 27r2<4g^, the cubic has all its roots real In this 
case we find the smallest positive angle <}> such that 

, 3« r 


when the real roots are given by 

2 1 <f> 

x...^3,*oos| 

2 , V — d> 

X,= -^qicos 

2 1 TT + (/> 

-7* COS- - 


- 


3 

Ex. 1 .—Consider the equation 

a:3+9a;2+23a;+14 = 0. 


(4) 


Writing a: = 2 / — 3 to remove the second term, the equation becomes 
^-4y-l = 0. (1) 

There are three real roots since 4.4®>27.1®. 

We have at once 

cos = Stand log cos ^ = f log 3 — log 16 = log cos 71° 2' 6e'’4, 
so that 1^ = 23° 40'68"-8. 


One value of y is now obtained by writing 

log Di = log 4 — ^ log 3 + log cos 23° 40' 68" 8 
= 0-3262913 
= log 2-114907, 

BO y, = 2-114907. (2) 


For the second root we liave 

log ( - Vjj) = log 4 - log 3 + log cos (60° - 23° 40' 68"-8) 

= 0-2697010 
= log 1-860806, 

so 1 ^ 2 = —1‘860806. (3) 

Lastly, 

log (- Vs) =log 4 - i log 3 + log cos (60° + 23° 40' 68'-8) 

= 9-4060073 
= log 0-2641016, 

so y,=-0-2641016. 


(4) 
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The required roots are therefore -0’885093, -4-860806, -3'2641016. 
Ex. 2. —Solve the equation 

(e*-f3!B®-|-6ai-27 = 0 

hy the trieeetion method. 

63. Graphical Method of solving Equations. —^We shall 
now consider the solution of an equation/(a;) = 0 by a graphical 
procedure. We first write the equation in the form 

/i(®) =/*(*) 

(as may usually be done in several ways), and draw graphs of 
the equations y=/i(*), y—ft{x). For example, to solve the 
equation - 4a; -f 6 = 0, the curves might be taken to be yi=ofi, 
gt - 4a; - 6. The abscissae of the points of intersection of these 
curves evidently satisfy the equation/(x) = 0, and are thus the 
roots of the given equation. 

Let one of these abscissae, as measured on the graph, be 
denoted by ai; we must, of course, recognise that aj is only 
an approximation to the root, since the accuracy of the reading 
is limited by the usual defects of graphical representation. 
To overcome this difficulty we now repeat the drawing of the 
poi'tion of the graph near the abscissa a; = ai on a larger scale. 
From the new diagram we find a new value x - oa such that aa 
is a closer approximation to the root than ai. Proceeding in 
this way, we form a sequence of values ai, aa, as, . . ., which 
approach continually closer to the correct value of the root of 
the equation. 

Ex.—Find the real root of the equation 

coexi^x 

eorreetly to eeven plaeee of deeimah, having given 

cos 42® 20' 40' = 0*739108,82, 

COB 42® 20' 60' = 0*739076,16, 
cos 42® 2 1' 0' = 0*739043,50, 
and «*/e48000 = 0*000004,848137. 

Suppose x^y", then the given equation becomes 
0*000004,8481 Z^y = cos y. 

We first plot the curves 

/j(y) - 0*000004,848137y'l 
/|W“COsy /» 


( 1 ) 
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using the following data ; 

42° 20' 40' = 152440", 0-000004,848137 x 162440 = 0-739060,0, 

42° 20' 60' = 152460', 0-000004,848137 x 162460 = 0-739098,5. 

Fig. 12 suggests as a solution the value y = 42° 20' 47'-3, so the first 
approximation for y is 

01=0-739084,98. 

Now cos 42° 20' 47"-2 = 0-739086,30, 

cos 42° 20' 47"-3 = 0-739084,98, 

and 0-000004,848137 x 162447-2 = 0-739084,91, 

0-000004,848137 X 152447-3 = 0-739085,40. 




Fig. 13 gives as a solution y=42° 20' 47"-248, and the value of the 
cosine is then 0-739085,14. The second approximation to the root is 
therefore 

ag= 0-739086,14, 
which is the required value. 
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64 Nomography. —Nomography is a special kind of graph¬ 
ical calculation which differs from other graphical methods 
in this respect: that with graphical methods generally it is 
necessary to draw a fresh diagram for every problem that is 
to be solved, whereas in nomography it is possible to use the 
same diagram over and over again for the solution of different 
problems, so long as these problems belong to the same type, 
and differ only in regard to the numerical data occurring in 
them. For example, there are many well-known graphical 
solutions of the quadratic equation in which the roots are 
obtained as the intersections of a circle with a straight line. 
The practical objection to these solutions is that a fresh diagram 
has to be drawn for each particular quadratic equation that 
has to be solved; and the time occupied in constructing the 
diagram is greater than the time required to solve the equation 
by the ordinary arithmetical method. This objection no longer 
applies if the diagram is of such a nature that when once con¬ 
structed it can be used for any quadratic, whatever be the 
values of the coefficients. Such a diagram, in which the 
construction is made once for all and is applicable to any 
number of special cases, is called a nomogram. 

The detailed treatment of the subject, which is an extensive 
one, is somewhat beyond the scope of the present book; the 
reader may be referred to the works of M. d'Ocagne* and 
S. Brodetsky t; we shall confine ourselves to illustrating the 
fundamental idea of nomography by describing a simple 
nomogram,! by means of which quadratic equations may be 
solved at sight. 

The accompanying diagram consists of two rectangular 
axes and a circle touching one of them (which we may regard 
as horizontal) at the origin. The circle may be of any arbitrary 
radius R The horizontal axis is graduated so that the gradua¬ 
tion p on it is at a distance 2R/p from the origin, and the 
other axis is graduated so that the graduation p on it is at a 
distance 2R/(l-jp) from the origin. The graduation at any 

• Train de rwmographie (1809) and Calcul graphique et nomographie (1914). 

t A First Course in Nomography (1920). 

j E. T. Whittoker, Edin, Maih, Soe, Notes, No. 19 (Deo. 1915), p. 216. 
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point of the circle is the same (with sign reversed) as the 
graduation at that point of the horizontal axis which is derived 
from it by projection from the highest point of the circle. 



Fio. 14.—Nomogram for the Solution of a? + aa; + 6=0. 


The method of using the nomogram is as follows: Let the 
equation to he solved he 

+ aa? + 6 = 0. 

Find the point on the horizontal axis at which the reading is a, 
and the point on the vertical a/xis at which the reading is h. 
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Imagine these two points joined hy a straight line {e.g. by 
Btxetohing a thread, between them, or by laying a straight-edge 
across). Where the line meets the cirde, read off the graduations 
on the circle: these are the required roots of the quadrctiic. 

The proof may be left to the reader. 

If the line is not conveniently situated on the diagram, we 
may replace the given equation, e.g. by the equation whose 
roots are the roots of the given equation with signs reversed, 
or the equation whose roots are the roots of the given equation 
multiplied or divided by 10 or some power of 10. 

Miscellaneous Examples on Chapter VI 

1 . Find the positive root of the equation 

a; + a* + a;* + = 5. 

2. Find the positive root of each of the following equations 

(a) ajlog«aj«18, (P) loge (a; + l)/(a; - 1) =* 2i>i 

Z. Find correctly to four significant digits the positive root of the 
equation 

a^-f 6a;= 1000, 

using either the Newton-Raphson method or the method of iteration. 

4. Compute to seven places the positive root of 

a;^ + 28a;^ = 480. 

5. Locate roughly the real roots of the equation 

aJ* - 3x2 + 75a; - 10000 = 0, 

and calculate one of them by the Newton-Raphson method to four places 
of decimals. 

6 . Show by the Newton-Raphson method that a root of the equation 

y* + (i(a; + a)y - a;® - 2a® = 0 

__« . . 509a^ 

“4'^64(*'*'672aa'*'ie384o» • 

(Wallis, Algebra (1685).) 

7. Solve the equation 

a;®+2a;®-a^-a; - 631064798 « 0 
by Homer’s method. 


(De Morgan.) 
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8. Apply the root-squaring method to approximate to the roots of 
the following equations to three significant figures: 

(a) 2a®-a;2-7a;+5 = 0, (p) + - 6x-i-2 = 0. 

9. Find by the root-squaring method the numerically greatest root of 
the equation 

4a4 + 128a^ - 96a^ 4 - 24® - 2 = 0, 

and determine the nature of the remaining roots. 

10. Show that the root-squaring method may be applied to determine 
the coincident roots of the equation 

a4 - lOa^ + 35®2 - 50® + 25 = 0. 

11. Determine z from the equation 

z = 6 Bin (n + z), 

where e is the eccentricity of a planet’s orbit» 
n is the mean anomaly, 

having given the values e = 0*208, n = 30*^ = 0*5235988. 
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CHAPTER VII 

NUMERICAL INTEGRATION AND SUMMATION 

A 

65. Introduction. —In the present chapter we shall show 
how to calculate the numerical value of a definite integral 

/ 

when f{x) is a function whose numerical value is known for 
values of x between a and 5. 

In works on the Integral Calculus it is shown that the 
above definite integral can be found, provided we can first 
find the indefinite integral of /(»), say F(aj): for the value 
of the definite integral is then F(6)“F(a). This method is, 
however, of very limited application: for even when f{x) is 
given as a compound of the well-known elementary functions 
^9 ^9 log®^ its indefinite integral cannot in general be 
expressed as a compound of a finite number of these functions: 
and when f{x) is not given in terms of the known elementary* 
functions, but is merely specified by a table of numerical 
values, the indefinite-integral method is altogether inapplicable. 
The methods described in the present chapter, which are of 
general application, are therefore of great practical importance. 

The problem of calculating the sum of a sequence 

«l + % + «^ + W4 + . . . +Un 

is closely connected with the problem of numerical integration, 
and is therefore included in the discussion. 

66. The Approximate Value of a Definite Integral.— 
A detoite integral 

J /.a+rto 

I f{x)dx 

a 

132 
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may be regarded as the measure of the area included between the 
curve y=^x), the ordinates »=o and x=a+no, and the axis of x. 

I^et P and Q be the two points on the curve whose abscissae 
are a and a+rw respectively, so that the area in question is 
FLMQ in the diagram. 

Let the base LM be divided into r parts each of length w, 
at the points U, V, W, . . and let the corresponding points 
on the curve be H, J, K, . . . Then we have 

area of quadrilateral PLUH= +J{a + w)}, 

„ „ HUVJ = \w{f{a + w) -^/{a +2w)}, 

etc., 



0 L U V W M r 

Fig. 16. 


and the sum of the areas of the quadrilaterals PLUH, 
HUVJ, . . . 

= w{+yi(o+w) +/(®+2w) +... +J{a +7^'. tp) + jyi(o+w)}. 

This sum may be regarded as a rough approximation to the 
area PLMQ, so that we have 

f{x)dx=lf{a)+f{a+w)+f(a + 2fu>)+. . . 

+/(o+f^l. w) + J/l[o+r!P) + T, 

where T denotes certain correction-terms. 

We shall now show how these correction-terms may 
be found. 
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67. The Ealer-Madaurin Formula. —Let us consiiler the 
case in which the function y(a!) is e®* where v is independent 
of X. Then the above formula becomes 


■| M+no 

-I e«'(fo= 4 e»“ + e'<“+")+«’<“+**’>+. . 

Wo 

+ J««»+«»w+T, (1) 

where T denotes the correction-terms; or (performing the 
integration) 

= e*” {1+e®"++Je®“(e®®"-1)+T 


pvatpww _ 1 \ 


BO 


+ i + - 


Now we have 

1 1 


( 2 ) 


s'-! 




, 0 \ 
i+2 + g+... ) 


-1 


0 + 2j + ^l + . . . 

'terms in 0*, etc.^ 

'5"2'^12^“720^'^ ■ ’ ‘ 


It is customarj to write this expansion 


1 


0 2^21 41 6 ! 


( 3 ) 


where the numbers B, which are called the BemouUian 
numbers, have the numerical values 


B, 





1 B B- ^ B 

30 ’ ® 66 ’ 


m 

2730 ‘ ’ ■ 


Comparing equations (2) and (3) we have 
T = e®“(6®®® -1)^ - 

Now if /(as)=6®*, we have /'(a5)=w®*, /'(as)*®®®**, , . . and 
therefore when/(a!)=«®*, we have 
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T= - ^{/(a + rw) -/(a)} + ^{/"{a + m) -/"{a)] 

+ . .. 

SO that in the case where f{x) = e'^, we have the formula 

I M+rw 

-j Ap )^=‘+/(<*+«») +y(o+ 2w) +... 

+ f{a + T -1 ,w) + \f{pb+rw) 
-^{fifl+rw) -/(«)} + H -/'"(«)} 

-/''(»)}+. . . W 


or, as we may more briefly write it, 

^jT = (|/o +/l +/2 + . . . +/r-l + |/r) 




(5) 


Now let /(a?) be an arbitrary function of x. We shall 
assume that it is possible to represent f{x) between the finite 
bounds aj==a and x=a + rw to a sufficient degree of approxima¬ 
tion by means of a sum of terms of the form Ae’’*, where A 
and V are independent of x : say let 

f{x)=Ayf^+A^e^^+A^e^+. . 

where A^, Aj, A 3 , . . «?& . . . are suitably chosen 

constants. 

Applying formula (4) to each term of this sum, and adding 
the results, we see that the formula (4) or ( 6 ) may be applied 
to such an arbitrary function f{x) in general. It is known as 
the Euler-Maclaurin formula, having been discovered inde¬ 
pendently by Euler and by Maclaurin in the years 1730-1740.* 

riw rfa; 

Ex,—To compute I — correctly to eight places of decimals, 

Jm ® 

In the Euler-Maclaurin formula put a=100, w=l, r =* 6, /(as) = -• 

* In both cases the publication was several years subsequent to the 
discovery. Cf. Euler, Comm. Acad. Sd, Imp, Petrop, 6 (1738), p. 68, and 
Maclaurin, Treatise of Fluxiorhs (1742), p. 672. 
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Thus 

- JL4.JL JLAJL 

/ioo a; “2 100 102 *^103 ■*"104 “^2 105 

_1_/ Jl_ 1 \ J / ^ _i_\ 

"l2\ 1052*^ lOOV"^ 120 V 105^‘*'l00V'^’ ‘ 

= 0-005000 000 _ J_/ 0-000100 000\ 

0-009900 990 12 \ - 0-000090 703/ 

0-009803 922 
0-009708 738 
0-009615 385 
0-004761 905 

= 0-048790 940 - 0-000000 775 

= 0-048790 165 

The calculated value of the integral is therefore 0-048790,165. 

This result may be verified by computing log^ (105/100) from the 
logarithmic series, as follows : 

5^^1 52 I *"^ 4 - 

100“ 100 2 TUO^’*’ 3 1002 4 100^"^ 5 VOO® 6 100®' ’ 


0-050000 000 
+ 0-000041 667 
+ 0-000000 062 

0-050041 729 


-0-001250 000 
-0-000001 562 
- 0-000000 003 

-0-001251 565 


= 0-048790,164 

which agrees with the previous result to one unit in the ninth place of 
decimals. 

68. Application to the Summation of Series. —The Euler- 
Maclaurin formula is often used in the computation of sums, 
when the integral which occurs in it can be calculated by 
other methods; the method will be obvious from the following 
examples: 

Ex, 1.—If it is required to obtain correctly to nine places of decimals 
the value of 

1 1 1 1 
2012'^ 2032"^ 2052 • •■^2992’ 


we have from the Euler-Maclaurin formula with a = 201, w=2, r = 

/(x)*l/*2 

— — 1—~ 

sJm 'ar®”2 201*'''203*'''' ‘ 299* 2 301® 


1/ 2_ _2\ 
6 \ 301 ® ■*' 201*7 


60 , 
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so 


201 * 


+ . 


1 ^1/ 1 1 \ 1/1 
’299® 2\201 301/21,201* 


■ 301*) 


1 / 1 _ 1 \ 

■*■ 3 ^ 201 * 30iV’'’' ■ 


= 0-000826 4326 
0-000006 8575 
0-000000 0288 


= 0*000833 319, which is the required value. 

Ex. 2 .—To compute the sum of the infinite series 

_1 L_ 1 1 

101*'^l‘03*‘''lb6*‘’'r67*‘'’’ ■ ■ 

If ill the Euler-Maclaurin formula we put /(^c)=l/5c^ r^oo, 

we have 


L-J_ 1 

2 a 2a^ ^ (a + 2)^ 


1 

■•■(a+ 4)* ■*■(^6)* 


J_ 4 _ 

■~3a*'^l5o« ■ 


SO that 


1,1,1, 
a*'*'(a4-2)*'^(<i + 4)*'^' 
Putting a= 101, this gives 

1 1 1 

1012*^ 10^2 Top ' 


• 2a’^2tt2“^3a» 


1 


2.101 2.1012’^ 3.1013 “ 15.101® * • * 

= 0 004960,49505 - 0-000000,00003 

+ 0*000049,01480 
+ 0*000000,32353 

= 0*004999,83336, which is the required sum. 
Ex, 3 .—Calculate correctly to ten places the sum of the infinite series 

_i J- J. „L 

n*'*' 12* ■^13*''’14®'*’' ■ ’ 


69. The Snms of Powers of the Whole Numbers. —The 

sums of the first, second, and third powers of the first n whole 
numbers are, as is well known, 

1 +2 +S +. . . + n = jTO*-fiTO, 

1* -H 2* 3* -I-. . . -f n* = + ^n* + ^n, 

18 2* + 3* +. . . + «® = J«® + Jm*. 

We shall now show that the Euler-Maclaurin formula 
enables us to find readily the sum of the ^th powers of the 
first r whole numbers, where p is any positive whole number. 
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For taking a = 0, io = 1, j\x) — xp in the formula 

/(»)‘^ = (^0+/l+/2 + • • •+/r-l+|/r)-^(/r'-/o') 


we have 

rP+i 1 1 

^-lP+2i> + 3P + . . . + {r-l)P + ^rP-j^-P-^ 

and therefore 

li> + 2i» + Si* +. . + rP = + — + S-rP~'^ - ^(P_Z^jj£rj)yg-» 

jp + 1 2^12 720 

p[p - l)(i? - 2)(jJ - 3)(y - 4) 

30240 “ • ‘ 


the last term being that in r or r^. 

This is the general formula for a sum of positive powers: it 
was first published in James Bernoulli’s posthumous work, Ara 
Conjeetandi, in 1713.* 


Eon.—Show that the sum of the 1th and 6th powers of the first n whole 
numbers is double the square of the sum of their cubes, 

Wc have at once 

l'^+2’ + 3^ + . . . + ii7 = |n® ++ + 

Similarly 

l® + 2® + 3® + . . . +w® = ++ 

We have therefore only to prove that 
(i»® + in’ + tV® - + tV*) + (in« + Jn® + 

= Mn + l)* 

which is evidently true. 

70. Stirling’s Approximation to the Factorial.— In the 
Euler-Maclaurin expansion 

ijr''"'>(®)(fe = (^0+/l+/«+. . .+/r-l + |A)-||(/r'-/o') 

+ ^C/r"'“/o"0-- • • 

' ♦ Ars Conj, p. 97. 
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write w = 1, f{x) = log x. The formula then gives 
1 

loga!<fa! = ^loga+log(o + l) + . . . + log(o+r-l) 


or 


(o + r + i)log(a + f)-^a + i) loga-r 


= log {(a+ !)(« +2) . . . (a + r)} 




12Va + r 




1/1 1 \ 

■^360V(a + r)»“oV“’ ‘ ’ 

Writing n for {a + r), this becomes 
(« + i)log»-(a + i)loga- ra + a = log (m !) - log (a !) 



^ ( 


iVn a 

) 360' 

.m» aV ' • * 


We have therefore 

log(«!) = C + (n + i)log«-« + jL_^ + . . . 

where C is independent of n. 

In order to determine C, we refer to Wallis' formula 


(1) 




'2 2 4 4 6 6 

s's's’s’r 


2n 


2n 


or 

and therefore 


—) 

2n -1 2« +1 / 
2<«(m!)« 


‘2 nSJ-{(2M)!}*(2» + l)' 


log H = Lt [4» log 2 + 4 log »! - 2 log (2m!) - log (2n +1)]. 

° 2 n^oo 

Substituting for the logs of the factorials from (1), we have 
log^= Lt ^ 4 mlog 2 + 4|^m + i^logm-» + cJ 

- 2|(2m + log (2ra) - 2m + C J - log (2m +1) J 

- -21og2 + 2C, 


BO 


C=.ilog(2,r), 
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and finallj 

log(n!) = (« + i)log«-» + ilog2^ + j^-3g^ + . . . 
or 9i! = 7i^+W27r-e“"^l +j^ + . . 


This formula is due to Stirling:* it is of great value for 
computing the factorials of large numbers. 

Bx, 1 ,—To commute log i^j(79!). 

In Stirling’s formula, replacing log(wl)by {log(7i-l)! + logn} we 
obtain 

m(» - 1 ) 1 = log «(» +1 log io(27r) 

f 11 1 1 \ 

+ M| i2n aeOn®'*'1260 m* leSOn’'*'' ’ ’/ 

where M = lagig«=. 0-434294,481903,251828. 

Taking m = 80, lagig80» 1-903089,986991,943586 

And 79-6 logio80= 161-296653,965859,515 

then 

logx«(79!) - 79-6 log „80 + -I log3o(2>r) + 

- M^80 + ggQ qqs + ie8o.8o0 


= 151-296653,966859,515 
0-399089,934179,058 
0-000462,390086,316 
0-000000,000000,105 
= 151-696196,290123,994 
= 116-961637,736607,649. 


- 34-743658,562260,146 

- 0-000000,002366,198 

- 0-000009,000000,000 

- 34-743668,564616,346 


This is the required value correctly to fifteen places. 


Ex. 2 .—Show that 

ijf f(.x)dx=(f^+f^^■f^ 


+ . . . +fr-.\) + 
7w* 


24 


6760 




(//-/o') 
967680 


(//-/o^- 


71. The Remainder Term in the Euler-Maclaurin 
Expansion. —We shall now investigate the error committed 
by truncating the Euler-Maclaurin expansion at any term.t 

* Methodw differentialis (1730), p. 137. 
t Jacobi, Jo%tm,filr Math, 12 (1834), p. 263. 
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Lot a set of polynomials <^(<), ^{t), ... be defined 

by the statement that is the coefficient of 8”/m! in the 

expansion of s. in ascending powers of s. We readily 


<#> 1 ( 0 =^. + _ 

The polynomials are called the ^polynomials of 

Bernoulli* 

From the defining equation 
*•^-1 

we see on putting ^ = 0, and ^ = 1 successively, that all the 
Bernoullian polynomials vanish when t = 0, and all of them 
after the first vanish when t-1. 

Moreover, by differentiating equation (1) with respect to t, 
we see that 

nto(w + l)! nti n\ 

Kemembering that 

s s ^ 

and equating coefficients of powers of s, we have 

= ^i(0 " h> (2) 

= ^2(0 + ( 3 ) 

(4) 

T ^ b {^) = <#>4(0 " ^ 2 » (^) 

and so on. 

Now denoting \)jf{x) an arbitrary function, we have 

r +u* rl 

f{x)dx = wj f{a + wt)dt 

* The name was given by Raabo in honour of James Bernoulli, the author of 
Ars Conjectandi, 
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and, by repeated integration by parts, 

f{x)ds = 2 {/(«+«’) +/(®)} - -l)f{a + wt)dt 

=f {/(« + w) +/(«)} - /(® + 

= |f/(a w) +/(o)} + ^ <h(i)/'(a+wt)dt. (6) 

But by (3) we have 

= - 

= - ^(/(« + «>) -/(«)} - /"{» + wt)<t>t'{t)dt 

= - ^{/(« + w) -/(a)} + C^) 

On substituting from (7) in (6) we have 
jT yi[a:)rf®=|{/(o+ w) +y](o)} - -^{f(a + m>) -j '(a)} 

+ 4>t{t)f''{a + wt)dt. (8) 

But by (5) we have 

“^{n^+w) -/»} 

=-/"(«)} +^{ Ut)r{a + y>t)dt. (9) 
Substituting from (9) in (8) we have 
jT =|{/(a + tp) +/(a)} - ~ {/(o + w) -/(«)} 

+ 5^{/> + «) -/'W + yJ^ ^(<)/»>(a + tp<)rf<. 
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This is the EvleT-Madaurin formula with a remainder-term 
expressing the error committed when the series is truncated at the 
terms involving the third derivatives. By proceeding in the 
same way we can find the error committed in truncating the 
Euler-Maclaurin expansion at any assigned term. 

If y][aj) is a polynomial, the Euler-Maclaurin series terminates 
and is exact. 

72. Gregory’s Formula of Numerical Integration.— 

The earliest formula of numerical integration to be discovered 
was one in which the correction terms were expressed in terms 
of differences instead of derivatives. This formula, which is of 
great use in practice, may be derived from the Euler-Maclaurin 
formula in the following way: 

In the Euler-Maclaurin formula 


"jT = (|/o+/l+/2 + • • + j2(//-/o') 






let US substitute for the derivatives from the equations 

wf^' = A/o - iAyo + W/o - iAVo + iAyo -. . . 

wfr' = A/.-i + + iAy,.4 + iAy.s +... 

«yo'"=Ayo-fAyo+iAyo-... 

= Ay,.3 + |Ay_4 + Wr-6+. . . 

«;yo''=Ayo-... 

«y/=Ay-6+... 

etc. 


It may be remarked that the formulae for the derivatives of are 
given in § 35, and that the formulae for the derivatives of fr may be 
obtained by forming a difference-table in which the entries are reversed 
in order, ie.• • •> /i./o> applying the equations 
of § 35. 

Thus we obtain 

+/i +fz + •' • +/r-i + 2/r) ~ j2(A/r-i - A/o) 

- +A%) - ^(A>/;-, - A%) - j|5(Ay,..+Ay.) 
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This formula was discovered by James Gregory;* like his 
formula of interpolation (§ 8), it does not require ordinates 
beyond the limits and fr in order to form the differences. 

On. taking/(a) = e®*, «= 1, a = 0, the formula becomes • 

^0 _ 1 2 
-= (1+«» + «*'+. . + 1)-V5(1-e”*)*” 

V 2 


whicli evidently breaks up into the expansions 


1 

V 




and 


1 

V \—e 




24' 


19 

'm' 


(1 


which are the expansions of 

1 ,_ 1 

log {H-(e®-1)} ““ log {1-(1-«-“)} 

respectively. The nuinerical coefficients in Gregory's formula are therefore 
the same as those which occur in the expansion of - {log (1'-»)}”^ in 
ascending powers of x. 

That this must be so may be seen at once by symbolic reasoning; 
for if p denotes the operation of differentiation, so that denotes 
integration, we have (§ 36) 

e«'» = E = l +A, 

and therefore 

Vi=_i__ z} _, 

w log(l+A) log(l—AE 1) 


whence Gregory’s formula may be derived symbolically. 

Expressions for the remainder after n terms of several formulae of 
numerical integration of this type have been found by H. P. Nielsen, 
Arkivfihr Math, Astron, och Fysik^ 4 (1908), Nr. 21. 


Ex, 1 , —Show that the coefficient of - {A^/r_n± Gregorys 

formula is 


(-!)» 


1 0 0 0 ... 
1 0 0 . . . 
Jr J 1 0 . . . 

i i i i--- 


[(n +1) rows] 


an4 that it may also be written 


/ 1)(®"-2) . . . (os-n + l), //N, :, ^ 

(-!)»/ - - - - - -^- 'dx. (Glaisher.) 

Jo 

* Letter of Gregory to Collins of date 23rd November 1670, printed in 
Bigand's Correspondeneet 2 , p, 209. 



NUMEEICAL INTEGEATION AKD SUMMATION 145 


Ex. 2.—To calculate 

Jm « 

correctly to seven places of decimals. 

By Gregory's formula, the integral is equal to 

i tJif + wr + lia + lis + to jr + h lir “ i^aCA/inw “ A/ioo) 

“ j4T(^%03 + ^%oo) ■“ “ ^%oo) “ • • • (1) 



/■ 

A. 

A®. 


11 

.g 

H 

0-010000 00 

-9901 



1 _ 
T (FT 

0-009900 99 

-9707 

194 

-5 

1 _ 
Tffa — 

0-009803 92 

-9618 

189 

-6 

1 __ 
TTJ^ — 

0-009708 74 

- 9335 

183 

-6 

1 _ 
T<rT- 

0-009616 38 

-9158 

177 


1 — 
1 O S ““ 

0-009623 81 





Substituting in (1) tlic required differences, we have 


T05 

0.005000 00 

100 X +0.009900 99 

+ 0.009803 92 
+ 0.009708 74 
+ 0.009615 38 
+ 0-004761 90 

== 0-048790 93 


- - 9158 + 9901) - ^\{177 + 194) 


-62 -15 


= 0-048790 16 


The required value is therefore 0*0487902 correctly to seven places 
Ex. 3. —Given the table 

X. f{x). 

1 287626,699801 

2 287757,439208 

3 287888,218227 

4 288019,036864 

5 288149,895125 

6 288280,793016 

7 288411,730543 

compute f(x)dx by the Gregory formula. 

(d3U) 6 
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Ex. 4 .—Calculate 

r 

725 ® 

*73. A Central-Difference Fonnnla for Ntunerical Integra¬ 
tion. —The formula of Gregory 

ijr^’"/(®)c7® = (^/o+/x+/2 + - • •+/r-l+|/r) 

- - A/o) - ^(Ay,., + Ayo) - ^(A%.3 - Ayo) -... 

utilises differences which lie in lines sloping towards the centre 
of the difference table. Evidently there must exist a corre¬ 
sponding formula in which central-differences are used, and 
which will therefore be of the form 


1 ra+rw /I 1 \ 

-= ( 2 *^® + • • . +/r -1 + 2 /r) 

4 . + A/y-1 _ AZ::! j 

+ B{A%.1-Ay.i} 

^^fAy,,i+AV,,2 ^ Ay,i + Ay,2 j 

+D{Ay.2-Ay.2} 

+... (1) 


where A, B, C, D are coefficients as yet unknown. To 
determine these coefficients, we may transform the derivatives 
in the Euler-Maclaurin formula, or more directly, we may 
write/(x) = 6«*, w = l when formula (1) becomes 


e(a+r)v _ ^av 
V 


or 

or 


fi(a+ryv _ M 1 

— i—r - e“*} + A sinh - e“®} 


1 

V 


1 

eO-1 


+ 2 + A8inho + . . 



1_/1 1 

t >“\»~ 2'^12 720 




+ C(v® +. . •) + • • • 
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Equating coefficients of powers of v we obtain 
A.-^, B-0, 
and thus we have the formula 

wla “ (2-^® + . . . +/r-l + 2/r) 

If A/, + A /,- 1 _ A/o + A/-i \ 

121 2 ' 2 / 

. Il/A»/,.1 + Ay,.2 A»/.i + A®/.2) , 

■^7201 r 2 

This formula, which is extensively used by astronomers in 
the calculation of perturbations, is in general more rapidly 
convergent than Gregory’s; but it involves values of / outside 
the range of integration. 


Ex. 1 .—Show that the above formula may be obtained by integrating 
with respect to 71 , between the limits 0 and 1, the interpolation-formnla 




A»/.i + 


w(»i® - 1 ) (n - 2 ) A*/_ 2 + 


3! “■/ 1 - 41 

Ex. 2 .—To cakulate ir from the formula 


+ . 


_ c 

~Jo 1 +** 


Taking r= 10 we have 

jo 1+*® 2 2-00■^1-81 ■^1-64 1-36 ■^1-26 ■’’l-ie'*’1-09 

1 11 lf A/,o + A/, A/,+A/.., ) 

1-04 1-01 2-00 121 2 2 J 

+ 11/^% - _ A®/-i+A®/.., l 

■*■7201 2 “ - 2 /+••• 


[Table 
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/- 

A. 


A». 

II 

e« 

L =0-9616386 
1-04 

285605 



M 

II 

-l-= 0-9900990 
1-01 

99010 

- 186596 

-11425 

fo = 

-4-= 1-0000000 
1-00 

- 99010 

- 198020 

11426 

fi = 

-i- = 0-9900990 
1-01 

- 286606 

- 186595 


fz = 

• 

^ =0-9615386 
1-04 




• 

/s = 

-i-= 0-6097661 
1-64 

- 672699 



/» = 

_L = 0-6624862 
1-81 

-524862 

47837 

1912 

II 

^=0-5000000 

2-00 

-475113 

49749 

-1162 

/u = 

^ =0-4624887 
2-21 

-426526 

48587 



/., =—L= 0-4098361 
2-44 


We have therefore, neglecting differences of order higher than the third, 



dx 

l+x^ 


= 10 


TT 

4 


= 0-2600000 
0-5524862 
0-6097561 
0-6711409 
0-7352941 
0-8000000 
0-8620690 
0-9174312 
0-9615385 
0-9900990 
0-6000000 


- ^ - 499988 - 0) + ^(376 - 0) 


7-8498160 +41666 + 6 

«■ 7-8539829. 
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Therefore 

7r= 4 X 0-78639822 = 3-141692,88 
whereas the correct value of tt is 3-141592,65 . . . 

74. Lubbock’s Formula of Summation. —In the Euler- 
Maclaurin formula 

1 ra+nv 1 ,,, 

= (/O +/l +/* + . - . +/r) - l (/0 +/r) - ^{fr -fo') 




-/o'")-- - - 


suppose that each interval is subdivided into m parts: the 
Euler-Maclaurin formula corresponding to the division is 

mm ^ 


Eliminating the integral between these two equations we have 
^ 4-/2 4-/3 +. . . 4*/y = w(/o4-/i4-/2 4-. . . 4-/r) 




f 4 . f \ ^ ( m^ ~ 1) / ^ /x , _1 If'"^ 

“^ “vAVo) '^2m m® 720^'^^ 


This formula enables us to deduce the sum of a large number 
of values of the function taken at any interval from a smaller 
number of values of the function, taken at intervals m times 


greater. 

If the derivatives in this formula are replaced by their 
values in terms of differences, as in § 72, we have what is known 
as LulibocKs formula* 

fo +/l +/l 4- . . . 4-/r = ^^(/o 4-/i 4-. . . 4-/r) g (/»* 

- 


* J. W. Lubbock, Ctemft. PAi7. Trans, 3 (1829), p. 323. The formula as 
given originally by Lubbock involved advancing differences of fr ; the formula 
here given, which does not require a knowledge of/« for values of x greater 
than r, is due to A. de Morgan, Diff, InL Calc,, pp. 317-318, § 191. It may be 
obtained readily by the use of symbolic operators ; cf. T. B. Sprague, J,I,A, 18 
(1871), p. 309. 
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Ex. 1 .—To cakulate the mm 

Let m=10, 10=10, and form the difference table of /(n) = ^, 


beginning with the value/g= 

/j = 0-00010000 

1 

1002* 




-1736 

416 



/j = 0-00008264 

-1320 


- 123 


/a = 0-00006944 

- 1027 

293 

-81 

42 

/8 = 0-00005917 


212 


26 

-815 


-55 


/, = 0-00005102 

-658 

157 



4 = 0-00004444 





By Lubbock’s formula we have 





150/1 \ ,9 99 

ITO~ • • +/8) “ +/o) - l 2 o(A/l- A/o) 


(A%+ Ayo)--?£>wJVA- A»/o) 


99 X 899 


(AVi + AVo) 


480000 

= 406710 - 64998 - 889-35 - 236-36 - 17-76 - 12-61 
= 340566. 

The required value is therefore 0*0034056. 

Ex, 2 .—Evaluate to seven places 

100 X 

Ex, 3 .—Show that Gregory's formula may he obtained by making m tn- 
crease indefinitely in Lubbock^s formula, 

75. Formulae which involve only selected Values of the 
Function. —If the function which is to be integrated is such 
that its differences beyond a certain order n may be neglected, 

r +rw 

/(»)<& in 

terms of the values of / and its differences as far as order », 
at selected values of x. But these differences can in turn be 
expressed in terms of selected values of f by reductions such as 

A/o =/i -fo, AYo =/2 - 2/i -h/o, etc., 
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f +ritf 

f{p^)dx entirely in 

terms of values of / at selected values of a?. Among formulae 
of this class may be mentioned specially the following,* which 
are accurate for functions whose fifth differences are constant: 


- A(/a + ^l/a+1 +/rt-f-2 + Q/a+3 +/a+4 + ^/a+6 ^/a+e)- 

(Weddle.) 

= 0-28( A +/.+6) + 1‘62(A+i +/,+,) + 2*2/,+3. 

-'a 


(G. F. Hardy.) 

f +io 

- 1 w{8(/a +/rt+io) 35(/rt 1-1 +/a-h8 +/a+7 +/«+») 

+ l^(/a+2 +/a+4 +/a+6 ^/a+s) + 

(Shovelton.) 

These formulae may be proved directly, without reference 
to Gregory’s formula, in the following way: 

Let it be retjuired, for example, to prove the second of them 
(G. F. Hardy’s formula). Since is a function whose fifth 
differences are constant, we can represent /a- by a polynomial 
of degree 5 in x, and therefore by a polynomial of degree 5 in 
the variable z, where » = cc - a - 3, say 

/»=/«+8 + “«+^ 2 * + T«® + ^ + «*®. ( 1 ) 


where a, y, 8 , c are independent of z. 

Therefore 

fjic=f (/„+3 + oiS + ^«* + 72 » + 82 ;« + ea*)rf2 

■'a ’'-3 

= 6/„+3 + 18/3 + ^ 6 . ( 2 ) 

Now let U 8 see if this can be represented by an expression 
of the type 

Mfa +/o+6) + ®(/»+l +/o+6) + 04+3* (3) 

where A, B, C are absolute constants. By ( 1 ), (3) may be 
written 

A(2/„+3 +18/3 +1625) + B(2/„+8 + 8^ + 328) + 04 + 3 . (4) 

Since the expressions (2) and (4) are to be identical, we may 

* A valuable discussion of formulae of this type is given by W. F. Shepiiard, 
JVoc. Lond. Math, Soc, 32 (1900), p. 258 ; and a detailed elementary treatment 
by A. E. King, Tram, Fac, Act, 9 (1923), p. 2ia , 
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equate the coefficients of fn+ 3 , And fi in them, and thus 
obtain the equations 

[2A + 2B + C = 6 
■ 18A + 8B = 18 

162A + 32B = 486/6. 

These equations give 

A = |g = 0-28, B = gi = l-62, C = 2-2, 
so (3) becomes 

0-28(/, +/„+«) + 1-62(A+, +A+s) + 2-2/„+3, 
and thus Hardy’s formula is established. 

Weddle’s formula may be deduced at once by adding the 

zero quantity the right-hand side in Hardy’s formula. 


Ex. 1 .—Evaluate 


( 


^ dx 


Weddle’s formula may be written 


r 


f 


" 3w 

^/a+w /a+2«» ^/a+sw ”l“/a+4M' ^fa+nw +/a+6w)* 


Let a = 0, wi = iV> aiid/a.= 1/ 
dx 1 

: = 7-a-000000 + 5-017452 + 1-014185 + 6-196773 


V(l-**) 40 

1-060660 + 5-500191 + 1-154700) 

= 0-5235990. 

The true value of this integral is or 0-5235987 correctly to seven 
places. 

Using Hardy’s formula we have 
/*' dx 1 

/ ^ 2-1547005)+ (1-62 X 2-1035285) 

* +2.2xl.0327955)} = 0.5235986. 

Ex. 2 .—Evaluate 

dx 

1 +^ 2 ’ 

using seven ordinates, 

76. The Newton-Ootes Formulae of Integration.— Among 
formulae of the type discussed in the last section, the oldest 


t- 
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and best known are the Newton-Cotes formulae^ which we 
shall now derive from first principles. 

Let it be required to calculate the integral of a given 
function between limits x = a and x = K By the transformation 




a+ 6 ?)-a. 

-r+-2-f 


we can reduce the integral to one in which the limits are -1 
and +1. Let Jiq, hi, . . ,,hn be numbers intermediate between 
- 1 and +1: we shall try to find a sum of the type 

+. . . + ^nAK), ( 1 ) 

where Hq, Hi, . . are constants, which will closely 

represent 

j A^)<i<»- (2) 

We are thus trying to represent the value of the integral by 
a kind of weighted average, in which weights Ho, Hi, . . ., H^ 
are attributed to the values of the ordinate/(a?), corresponding 
to the abscissae ho, hi, . . ., h^- In order to secure that the 
sum (1) shall represent closely the integral (2) we shall lay 
down the condition that the sum (1) is to be equal to the 
integral (2) exactly so long as f{x) is any polynomial of degree 
less than {n + 1). 

Denoting the product {x - 7/o) (a; - ^i) ... (a? ~ h^ by F(a5), 
we see that the sum (1) exactly represents the integral (2) 
when 




since this function is a polynomial of degree n. The sum (1) 
then reduces to 


Hy(7ty — Aq) (^r ^i) • • • 

where the factor {hr - h^ is omitted; and this may be written 


This gives at once 


H, Lt or H,F(/i,). 

E (^r)J.i X-hr 


a formula which determines the H's when the h’a are known. 

(d811) 6« 
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This formula might have been derived by remarking that 
when /][«) is a polynomial of degree less than {n +1), it may be 
represented aeeuratdy by Lagrange’s formula of interpolation 
(§ 17), which may be written 

The required formula is obtained from this at once by 
integration. 

Let us suppose that the h‘i are chosen at equal intervals 
apart so that 

Ao = — 1 , 7*1 = -1 + —, ^2 = -1 + —, . . = 1 . 

n n 

Then 

. . . (®-A,-0(®-Ar+i) . . . {x-K) 

= . . . ( 7 -»)}, 

where t = \n{x +1) and 

F (A,) = {hf — Aq) . . . {Jhj. — hf^ij (A, — 7ir+i) • ■ • (Ar — 7*„) 

We have at once 

where 

'(<-1) ■ ■ • +1) -»)*■ 

2%is is known as the Newton-Cotes formula of integration* 
More generally, writing = a + rw, we have 

f +nw 

Ax)dx = Ho/(a) + Hiyi(a + w) 

+ 02^(0 + 2w) + . . . + + nw), 

* Newton, Letter to Leibnitz of date 24th October 1676 ; Principia, 3 (1687), 
Prop. zl. Lemma 5; Cotes, Harmonia mensurarum (1722); James Gregory 
(for »»2)«in Exercit, geom, (1668). Cf. Bickley, Math, Gazette, 28 (1939) 352. 
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where 

^r 1 (n r)lj ^~ ’ ' ' (^ - r +1) (^ - r -1) . . . - n)dt. 

The values of the coefficients in a number of cases are given here 
for reference; 


For 71 = 1, w=l 
For 71 = 2, iv=^ 
For ?i = 3, w = ^ 
For 71 = 4, 70 = \ 
For n = b, 70=\ 
For n =6, w = ^ 


00 = 112= J, 01= 

Ho = Il3=^, IIi = Il3 = -J 

0 o= 04 =irV. 0i=08 = 1S.02=tV 
0O = 05=i^% 01 = 04=U. 02 = 03= *^ 


TT _ TT — ♦ 1 H — TT — ® TT — TT — « TT — 3 4 
"0 “ “ H fTr> ~ "6 ““ B y> -tig — XI4 — 113 — j 

Note tliat we must always have IIq + Hj + H 2+ . . . +H^ = nM;. 

A formula making use of the ordinates at the middle of the intervals 
(instead of the ends of the intervals) was given by Maclaurin, Fluxions, 
2 (1742), p. 832. 

An expression for the remainder after n terms in the Newton-Cotes 
formula has been given by J. F, Steffensen, Skandinavisk AktuarietidskTrift 
(1921), p. 201. 


Ex.—Calculate 


r' dx 
jol+SC 


hy the Newton-Cotes formula Tvith eight ordinates. 

Let 70 = \ SC) that 71 = 8, and write 

<^(Q = <(«_1) ... (<-8). 

The coefficients H.,. may be written 

^ (-1)^-^ 1 
r!(8-r)!8jo e-r 

and we have the values 

H _W— -N— TT — 2944.fJ^_- TI^ -- _ _ 4£4 . 

0 — tig — — ^*7 “ TT 1 T5 • ^2 ■” ^6 ~ TTTTX • 

_TT _ 6348 .TJ _ __ 464 

3=ll6-TTTTF •1^4- ~ir?rBBr- 

Since f{x) = we have 


1 6 

2 -S' 

2 8 

IS TJf 


8 7 

T S 
4 8 

T TY 


M 1 f t- A t A T * i 

14175fr 494-5 2944 -464 5248 -2270 5248 -464 2944 494-6 
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166 


Then 


f dx 

i.l+* 


d/sa 8 4*8 2 8 4 


192 48 192 2 1 

494-6 X 1-6 + • 2944 - • 464 -f ^ - • 6248 - ^ • 2270 

lo5 Jo 14a a J 


_L/ 

14176\ 

= 0-69314721, 

whereas the true value is 0-69314718, 


77. The Trapezoidal and Parabolic Rules. —The simplest 
cases of the Newton-Cotes formula are the following: 

1 ®. » = 1 . 

r -l-W 

J{x)dx = ^w/{a) + ^wj\a + w). 

This is known as the trapezoidal ride. It is exact when 
/(os) is a function whose first differences are constant. When 
f{x) is a function whose first differences are not constant, the 
difference between the two sides of the above equation may 
be written 

where 0<^<1. 


2 ®. 71 = 2 . 

J\x)dx = ]iiWf{a) 4- %wf(a ^w)-\- \wf{a + 2w). 

• a 

This formula was first given (in a geometrical form) by 
Cavalieri,* and later by James Gregory! and by Thomas 
Simpson t; it is generally known as Simpson's or the paralolic 
rule. It is exact when/(a?) is a function whose third differences 
are constant; when f{x) is a function whose third differences 
are not constant, the error involved in the use of Simpson’s 
formula, i.e. the difference between the first and second members, 
may be expressed in the form § 

where O<0<1. 

* Centuriadi varii prohlemi^ Bologna (1639), p. 446. 
t Exerdt. geom.^ London (1668). 

} MaJth. JHaaertationSf London (1748), p. 109. 
g Feano, Applkazioni geom. del calcolo injin.^ Turin (1887), p. 208. 
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3® « = 3. 


ra+3w 

I f{x)dx = fw{/(a) + ij{a + «>) + 3yi(a + 2w) +yi[a + 3w)}. 

Ja 

This is generally called the three^eigliths rule. 

In practice, when it is required to compute an integral, we 
first divide the range of integration into a number of intervals 
and then apply one of the above rules to each of the intervals 
and add the results so obtained. Thus when we use Simpson’s 
rule after dividing the range (say from a to a-\-2pw) into p 
intervals, we have for the value of the integral 

^wf{a) + fiwf[a + + ^wf(a + 2w) 

+ ^wf {a + 2u^ -I- + 3?/?) + + Aw) 

+ \u)f{a + Aw) + %ivf{a + bw) + lwj{a + 6?/;) 

+ . . . 
or 

\w[f(a) + 2{/(a + 2w) +f{a + Au^ +. . .} 

+ A{f{a + w) +/(a + iw) +. . .} ^-f(a + 2jnv)\ 

whence the form in which Simpson’s rule is generally stated: 
Form the sum of the extreme ordinates, twice the sum of the 
even ordinates, and four times the sum of the odd ordinates; 
and multiply the result hy one-third of the interval of the abscissa. 
Ex. 1. —Calculate 

f' dx 

Jo 


by the Simpson formula, using nine ordinates. 


Let f{x) = a = 0, and -m; = . 


0 


/(»: 


1 

■ft' 

H 

V 


1 

T 

4 

T 


3 

S’ 

8 

TT 


Then 


We have the values 


2 

IS 


5 

‘ft’ 

8 


i 

T 


X 

8 

8 

T5^ 


1 

1 

TS 




'8 8 
9*^11 


=—{1-6 + 4-076190 5 + 11-059518 3} 


24 

= 0-693154 5. 




The value obtained is 0-693156, whereas the correct^ value is 
log 2 = 0-693147, correctly to six places. 
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Ex, 2. —Show that if in evaluating the area between the ordinates 
«1 and x=s2 of the rectangular hyperbola xy = \ the formula of 
Simpson with 11 ordinates is employed^ the error is 0'000003. 

78. — Woolhouse’s Formulae. —A number of integration formulae 
akin to those which have been considered in the last three articles were 
given by Woolhouse in Journ, Inst, Act, 27 (1888), p. 122. He recom¬ 
mended the two following specially for their “high and persistent 
approximation.” 

[10 f 223, , 6875 , , 4625 , , 41 ^ 

[9-4486938](Mj + Mjo) + [0-2092449](Mj + it,) 

+ [0-3394319](it3 + itj) + [0-2626368]itj 

(where the numbeia in square brackets are the logs of the coefficients), 
and 

7 , V 16807, , 128, ^ 71 1 

M(i«-14|jgg(ll3 + 1i„) + ggggg(M2 + 1t„) + 2gg(lt, + 1t,j)+ 

- [9-70H914](it, + u„) + [0-6476676](it, + it„) 

+ [0-7984931](it, + u^) + [0-8670626]«i4. 

79. —Ohebyshefs Formulae. —If, instead of laying down the 

condition in formula (1) of § 76 that the intervals ftj — h^, — Aj, etc., 

are to be equal, we lay down the condition that the coefficients 
H„ Hi, . . , H„ are to be equal to each other, we obtain a set of 
formulae first given by Chebyshef,* which have found acceptance chiefly 
with naval architects. They have a certain advantage when the ordinates 
are experimental data liable to unknown errors; for if wo have a 
number of quantities which are equally liable to be alfected with error, 
and if a linear function of tliese quantities is formed, the sum of the 
coefiicients in this function being fixed, then the probable error of the 
function is least when the coefiicients are all equal. 

Chebyshefs general, formula is that if f{x) is a function whose 
{n + l)th differences are negligible, then 

f(x)dx:=^{/(xj)+/(xg) + . . .+/(*„)}, 

where 0 ;^ x^ . , ,, Xn are the roots of a certain polynomial, which is, in 
fact, the polynomial part of the expansion of 

n n n 

jpn^“2.Sa^-4.6a^"'6.7.a5«“‘ * ’ 

in descending powers of x. 

Thus, when f(x) is a function whose sixth differences are negligible, 
we have 


1 "^ y(*)<fo==|{/(ai)+/:®,)+A<^+/«)4+/(a^}, 


* Joum, de math, (2), 19 (1874), p. 19: Assoc, Frang, 3 (Lyon, 1873), p. 69. 
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where x^y • • •> *5 the roots of the equation 

so that - iCj = 0*832437 = 

-a;a = 0*374542 = a; 4 , 

353 = 0 . 

Chebyshefs forniiilae were suggested by a formula due to Bronwin,* 
namely 

L 

which is rigorously true Avhen f{x) is a function whose 2 wth differences 
are zero. Thus if /(sc) is a function whose sixth differences are negligible, 
we have 

80. Gauss’s Formula of Numerical Integration. —Suppose 
now that instead of prescribing the numbers hy, . . of 
§ 76 by the condition that they are to be at equal intervals 
apart, we determine them by the condition that the formula is to 
have the highest possible degree of accuracy; since there are at 
our disposal (2» + 2) constants /to, Aj, . , ,, A„, Ho, Hj, . . H„, 

we can choose them so as to make the formula rigorously true 
when f{x) is any polynomial of degree (2n + l). Denoting 
the product (aj-Ao)(»-Ai) . . . (»-A„) as before by r(®), we 
must therefore have in particular 

j /(«)(/«:=J (f>(x)F(x)(/x = 0, 

where ^(x) denotes any polynomial of degree less than (n +1). 
Now it is a known property f of the Legendre polynomial 
P„+i(a:) that 

f <f>(x)P„+i(x)dx = 0, 

■'-1 

so long as <f>(x) is any polynomial of degree less than (n +1). 
We see, therefore, that the stated conditions are satisfied 
by taking 

F(x) =P„+i(®), 

* Phil. Mag, 34 (1849), p. 262. 
t Whittaker and Watson, Modern Analysis, § 15-14. 
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so the numbers Ao, Ai, . . A„ are the roots of the Legendre 
polynomial of degree (n + 1).* The H’s are then given by 
formula (3) of § 76. 

In the simplest oases we obtain the following values for 
the h’s and H’s; 


n = l, ^0=-^. = Ho = l, Hi = l. 

« = 2. Ai = 0. Aa = -J. Ho = |, Hi = | H2 = |. 

Jj 2 — 2 — ^ I ^ Ji 2_7i iZ® 

71 — u, flQ — ^'3 — ij '^1 — '^''2 ~~ tj ^5* 

„ „ 18-^30 ^ „ 18 +a/30 

llo = ±l 3 = -2g Jll = ±l 2 =--. 


36 


Converting to decimals and transforming the variable so that 
the range of integration is from 0 to 1, we have 

f tP{x)dx = Aox/^{Xq) + Ai^pixi) +. . . + An^(a?n), 

Jo 

where Xr and are the transformed values of hr and H, 
respectively, and in fact 


71 = 1 

iCo = 0-211324 87 
»! = 0-788675 13 

Ao = Ai = 0-5 

71 = 2 

av, = 0-112701 67 

Ao = A 2 = 


xi = 0-5 

a^ = 0-887298 33 

II 

71 = 3 

a;o = 0-069431 84 

Ao = As = 0-173927 4 


»! = 0-330009 48 
*2 = 0-669990 52 

3^ = 0-930568 16 

Ai = As = 0-326072 6 

71 = 4 

*0 = 0*046910 08 

Ao = A4 = 0*118463 4 


*1 = 0*230765 34 

Ai = As = 0-239314 3 


*2 = 0-5 

*4 = 0*769234 66 
*1 = 0-953089 92 

A2 = 0-284444 4 


• Gauss, “Methodus nova integralium valores per approx, inveniendi,” Gott, 
Comm. IIL (1814)= JVerke, 8, p. 163. 
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Ex. l.—^Find hy Gauss's method, with n = 4, the value of 

i: 




Writing 


//V login (1+a:) 

^ ^ ,^.2 = 2*302585 09• 


l+ic2 


we may arrange the working in the form 



a-„. 


1 + .'C 

1-046910 08 

1-230765 34 

JogioCJ +*) 

0-019909 38 

0-090176 26 

A 

0-118463 4 

0-284444 4 

l+j* 

1-002200 56 

1-053252 64 

login 2-302585 09 

0-362215 69 

0-362215 69 

H'w {Jogio(l+»0} 

8-299057 73 

8-956087 40 

logio A- 

9-073584 20 

9-378968 65 


17-734857 62 

18-696271 74 

Jogio(J+**) 

0-000954 64 

0-022632 56 

H'w A^( '-) 

7-733902 98 

8-673739 18 

H(.-^ 

We thus obtain the values 

0-005418 80 

0-047177 96 


Finally we liave 

f. 


0-005418 80 
= 0-047177 96 
A2^(ig) = 0-092265 82 
A3#^) = 0-085781 36 
■^4V<'(*4) = 0*041654 03. 


= 0-272198 0, 




whereas the correct value of this integi-al — ^ Jog 2^ 0-272198 3. 

Ex. 2 .—Show that a degree of accuracy equal to that of Gaus^s formula 
is obtained hy the use of 

j_, ^ {lyA)}*’ 

the sum being extended over all the roots h of the equation 
Fw+i(*^’) Fji——Oj 

that is, over +1, — 1, and the roots of P./(ic) = 0. 


(Radau.) 
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Miscellaneous Examples on Chapteb VII 

1. Show that if fifth differences are negligible 

i+6n 

/aK^ = 0-28(/a +2/^+6+ 2 /a+i 2 + . . •+/«+en) 

+ l*62(/a+i+/a4.3 + . . .) +0'S®(/rt + 3+/a+9 + * • -)• 

(Shovelton.) 


r 


2. Show that if fifth differences are negligible 

//pdflC = +/a+2 +/a+4 + • • • -^fa+en) 

+ 5(/a+i+/a+3 + . . •+/«+6n-l) + (/fl+3+/a+6 + ' • •+/a4.6»-3)} 


(Shovelton.) 


3. Show that the integral 

I </>(«) (1 — ic)^(l + xydx 

•'-1 


may be represented by the sum 

exactly so long as <li{x) is a polynomial of degree less than 2n, provided 
the abscissae h^, . are the roots of the equation F(a5) == 0, where 

F(x) is defined by the relation 

d” 

(a; - l)^(aj + iy¥{x) = }, 

and the H’s are defined by 


1 


dx. 


(Radau.) 


4. The values of a function f(x) are given by the table 


X, 

fix). 

1-060 

1-25386 

1-000 

1-26996 

1-070 

1-28619 

1-D80 

1-30254 

1-090 

1-31903 

1-100 

1-33566. 


Using the Gregory formida, calculate 
/■i-ioo 
/ 

Jwso 
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5. The values of a function f(x) are given by the table 


Evaluate 


X. 


41 

396 254 161 

42 

406 586 896 

43 

418 161 601 

44 

429 981 696 

45 

442 050 626 

46 

454 371 856 

47 

466 948 881. 


/47 

/ J{x)dic, 
Ui 


6. If/(a) is a function whose fourth difterences are negligible, prove 
that 


J ^ /(2 — J'‘l) + J[2 + 

and, more generally, if f(x) is a function whose 27ith differences are 
negligible, obtain a formula 

f e-^f(x)dx= ^ Akf(xk\ 

Jo 

where xi, a^, . . ., % are the roots of the polynomial 

(A. Berger.) 

7 . If/(a;) is a function whose sixth differences are negligible, prove 
that 

j jr^x)dx^ (-\^|) + m +f 

and more generally, if f(x) is a function whose 2?ith differences are 
negligible, obtain a formula 


/: 


er^J(x)dx=> 2A*/(a;*), 
, *=1 


where Xi, are the roots of the polynomial 

(A. Berger.) 

8 . Show that Hardy’s formula (p. 151) may be derived from Gauss’s 
general formula by taking the nearest integral values for the ordinates. 
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CHAPTER VIII 

NORMAL FREQUENCY DISTRIBUTIONS 

81. Frequency Distributions. —In statistical investigations 
attention is directed to a group of persons or objects, and an 
enumeration is made of the individuals in the group who 
have some particular attribute: for instance, we may con¬ 
sider the group consisting of all the inhabitants of Edinburgh 
and enumerate those who are widowers. In the most important 
and interesting cases the attribute is one which is capable of 
being expressed by a number; thus the attribute might be the 
height of the individual, which is expressible as a number of 
inchea In such cases the whole group may be partitioned 
into classes or sub-groups according to the value of this 
number: thus the whole group of the inhabitants of Edinburgh 
may be arranged in classes according to the number of inches 
in their stature; one class, for example, might consist of persons 
whose height is between 68 J inches and 69 J inches. Now let 
Xi be the number of inches in the height of the shortest person 
in the city, and let the number of persons whose height is 
between and Xi + i inches be yi; let + l be denoted 

by X 2 , and let the number of persons whose height is between 
X 2 -i and «2 + J be y 2 > let a ?2 +1 be denoted by xs, and let the 
number of persons whose height is between Xs-^ and xs -hi be 
denoted by ya, and so on. Then we may regard xi, X 2 , xs, • • • 
as successive values of an argument and yi, ya, ysi . . • as the 
corresponding values of a function of this argument. The 
table of values thus obtained specifies the distribution of the 
heights among the inhabitants of Edinburgh: such a distribu¬ 
tion is called a, frequency distribution. 

164 
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As ail example of a frequency distribution, we may take the follow¬ 
ing results of measurements of the chests of 6732 soldiers in Scottish 
regiments.* 

Chest measure in inches 33 34 35 36 37 38 39 40 

Number of men , .3 19 81 189 409 753 1062 1082 

Chest measure in inches 41 42 43 44 45 46 47 48 

Number of men . . 935 646 313 168 50 18 3 1 

The type of frequency distribution which is most familiar 
to the worker in experimental science is the distribution of 
the measures obtained by repeated measurements of the same 
observed quantity. Let the true measure of an observed 
quantity be a. Since measurements made of this quantity 
by the same or different observers are aflected by errors of 
observation, the measures actually obtained will not all be 

equal to a, but will form a group of measures ^ 2 , « 3 , a^, . . . 

not differing greatly from a. The practical problem is to obtain 
the best possible estimate for a when we know the measures 

02 , Os, . . and also to estimate the error to which this 
value is liable. In order to solve these problems, we must 
study the type of frequency distribution to which the group 
czi, ^ 2 , 03 , .. . belongs. 

82. Continuous Frequency Distributions. —In the above 
numerical example we have grouped together in a single class 
all the 753 men whose chest measure is between 37^ and 
38J inches. Supposing that full information regarding the 
individual measures is at our disposal, we might have divided 
this class into two classes, one consisting of men whose chest 
measure is between 37^ and 38 inches, and the other whose 
chest measure is between 38 and 38^ inches; and in this way 
we might subdivide each of the original classes, thereby 
evidently doubling the total number of classes and so producing 
a more detailed statement regarding the chest measures of the 
men. Further, we might divide each of the new classes into 
two, thus quadrupling the original number of classes. But 
evidently if we attempt to proceed very far in this direction 
we shall meet with practical difficulties: thus no statistician 
would think of trying to arrange the men in classes each of 

* Edinburgh Medical Journal, IS (1817), p. 260. 
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which comprehends a variation of only one-thousandth of an 
inch in height, partly because the measures cannot be relied 
on to such great accuracy, and partly because the numbers of 
men in the classes would become small and irregular, and 
would cease to present to the mind a clear picture of the 
frequency distribution. 

For theoretical purposes, however, we may disregard these 
practical difficulties and consider an ideal case in which the 
measures are supposed perfectly accurate, and the number of 
individuals in the whole group is supposed very large, so that, 
however narrow we make the qualification for membership of 
a class, there will always be enough members in it to make 
the sequence of the numbers of members of classes a regular 
sequence. Now consider the class constituted of individuals 
whose measure is between x and x + e, where € is a small 
number. The number of members in this class will evidently 
be approximately proportional to c, since doubling the range 
of qualification for the class would approximately double the 
membership; and it will also be proportional to the number 
N of individuals in the whole group, since, if twice as many 
soldiers were measured, the number of soldiers in each class 
would be approximately doubled. Let us then denote the 
number of individuals whose measure is between x and x + € by 
Ncy; then the number y depends on x, and in fact the way 
in which y depends on x specifies completely the frequency 
distribution. We shall often express the dependence of y on x 
by writing <jE>(x) for y, and we shall use the differential notation, 
writing dx for €, so that in a group of N individuals the number 
whose measure is between x and x + cto is denoted by N</>(x)rfx. 

Expressing the same idea in other words, we may say that 
<^(x)c?x represents the probability that an individual chosen at 
random in the whole group has a measure between x and 
x + rfx; and hence if the frequency curve y-<l>{x) be drawn, 
the measure of any individual is equally likely to be the 
abscissa of any point taken at random within the area bounded 
by the curve y = <l>{x) and the axis of x. 

The earliest mathematical discussion of a frequency distribution seems 
to have been that of Simpson (1756) in connection with the Theory of 
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Errors of Observation. Simpson assumed that the probability of a positive 
error lying between x and x + dx was </>(a;)da;, where ^{x) = - mx + c, 
m and e being positive constants: the probability of a negati ve error 
was assumed equal to the probability of a positive error of the same 
amount, and errors of greater magnitude than c/m were assumed to be 
impossible. The frequency curve for the errors is therefore, in this case, 
two sides of an isosceles triangle, together with the prolongations of the 
base outside the triangle. 

83. Basis of the Theory of Frequency Distributions.— 

We shall now approach the discussion of frequency curves from 
the theoretical side. The considerations on which the theoretical 
investigation is based were first put forward by Laplace, and 
may be described as follows. 

Consider the frequency distribution of (say) the chest 
measures of the soldiers (§ 81). If we fix our attention on an 
individual soldier, we may say that a great many different 
factors have contributed to make his chest measure what it 
actually is. For, owing to heredity, it will have been influenced 
by the chest measures of Ins father and mother; but the chest 
measure of his father will in turn have been influenced by 
those of his father and mother, and so on, so that ultimately 
' the chest measure of the individual soldier we are considering 
may be regarded as influenced by the chest measures of a very 
large number of individuals of a remote generation, each of 
them singly making only a very small contribution to the total 
effect. Moreover, other factors will enter: for example, his 
nourishment and exercise at different ages of life. Thus the 
deviation of the actual chest measure of an individual from the 
average may he regarded as the sum of a very great numher of 
very small deviations {positive or negative) due to the separate 
factors in the heredity and environment of the individual. These 
multitudinous small component deviations will be assumed to 
be independent of each other, in the sense in which the word 
" independent ” is used in the Theory of Probability. 

84. Galton’s Quincunx. —An interesting piece of apparatus 
was devised by Galton* to illustrate the formation of a 
frequency distribution from the joint effect of a large number 
of small and independent deviations. 

* Natural Inheritance^ p. 63 ; for experimental results cf. 0. Gruber, Zeits. 
fUr Math. u. Phys. 56 (1908), p. 322. * 
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Into a board inclined to the horizontal about a thousand 
pins are driven, disposed in the fashion known to fruit growers 
as the quincunx, i,e, so that every pin forms equilateral triangles 
with its nearest neighbours. At the top of the board is a 
funnel, into which small shot is poured. The shot in descend¬ 
ing strikes the pins in the successive rows, each piece being 
deviated to right or to left at every encounter with a pin. At 
the bottom of the board are about thirty compartments into 
which the shot ultimately falls. It is found that the middle 
compartment receives most shot, and that the falling-oft’ in the 
amount of shot received by the compartments, as we proceed 
outwards from the middle compartment, resembles closely the 
falling ofif in the number of men having corresponding devia¬ 
tions from the average chest measures (§ 81), or the falling off 
in the number of measurements which have corresponding devia¬ 
tions from the average of the measures of an observed quantity. 
In fact, the curve formed by the outline of the heap of shot is 
a frequency curve of a commonly occurring type. 

85. The Probability of a Linear Function of Devia¬ 
tions.* —We shall now proceed to the analytical discussion of 
frequency distributions. We start from the fact that a measured 
quantity, such as the chest measure of a soldier, is liable to 
vary from one individual to another, or, in other words, to 
exhibit a deviation from its average value; and we shall suppose 
that this deviation is the total effect of a vast number of very 
small deviations, due to causes which operate independently of 
each other: denoting the small deviations by Cj, Cg, . . €», we 

shall suppose that the total deviation is their sum, or, more gener¬ 
ally, is a linear function of them, say, AjCj + AgCg +. . . + 

We want to find the probability that in the case of a given 
individual soldier this total deviation shall have a value 
between (say) and where and are given numbers. 

In the present section we shall not require to make use of 
the assumptions that the deviations Cg, . . €» are very 

small or very numerous. 

Let I 4ir{^)dx be the probability that the ?’th deviation Cy has 
Ja 

* Lajj^lace, Thiorie anal, des proh, , livro II. chap. iv.; Poisson, Connaissance 
des temps (1827), p. 273. 
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a value between a and 6, so that is the probability that 

it has a value between x and x + dx. Now the probability of the 
concurrence of any number of independent events is the product 
of the probabilities of the events happening separately; and 
hence the probability that the first constituent deviation has a 
value between and while the second has a value 

between Cg and and so on, is 

<Ai(€i)</>2(€2) • • • M^n)d€id€2 . . . dCn, (1) 

and the probability that + . . . + has a value between 
and is therefore the integral of the expression (1), taken 
over the field of integration for which +. . . + 

lies between u\ and w^. 

Now by Fourier’s Integral Theorem * the expression 

Z7ry_Qo Jwi 

has the value unity when x lies between and and has the 
value zero when x does not lie between these numbers. So 
instead of integrating the expression (1) over the field for which 
+ first multiply the 

' expression (1) by 

1 r r* gi»(T-A,n-V,-... 

27rJ«.oo Jwi 

and then integrate the resulting expression over the field of all 
values of Cg, ..The probability that + X^e^ +... + An€„ 

has a value between and is therefore 

r ... r . . . 

y - 00 y - CO 

</>w(€n)rf€jrf€2 . . . d€ndTd6. 

* The theorem in question is that under certain conditions (for which cf. 
Whittaker and Watson, Modern Analysis, § 9*7), the integral 


1 r* 

2^-00 Jwi' 


rW2 


-f do f cos {^(a; - t)}/(t)(/t 


or aW doj 

•Loo *'«»! 

has the value J{x) when x lies betAveen w’j and icj, and has the value zero when 
X does not lie between those numbers. 

t This device is due to Cauchy, Comptes B.ei\dus, 137 (1853), pp. 109, 264, 


334 . 
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Writing 1V(^) for e~^<l>r{x)dx, the required probabiKty ia 
y - 00 

therefore 

^/°1C • • ■ f^nMdrde, 

or, performing the integration with respect to t, the probability 
that + AgCg +. . . + kn€n Uos between x-e and ic + c is 

TtJ «oo ^ 

where Q{6) is written for and 

therefore the probability that +. . . + A»€n lies between x 
and a? + da; is <l>(x)dx, where 

^{x)^^[ji{ey^de. ( 2 ) 

From the last equation we have, by Fourier’s Integral 
Theorem, 

Q(0)= r e-*^<f,(a;)dv, (3) 

J -00 

so ^l(x) bears to r/)(x) the same relation that 12,.(a?) bears to <f>r(x). 

It appears, therefore, that when deviations are added to form an 
aggregate deviation, the corresponding functions are multiplied 

together to form the function The underlying reason for this is 

tliat when the S2’s are multiplied together, the arguments of the 
exponentials add together in precisely the same way as the deviations. 

Now write .s* fori a^<fir{x)dx, so that is (in the language 
y - 00 

of the Theory of Probability) the expectation of the Ath power 
of the rth deviation. Then by the definition of 12^(0) we have 

= 1 - iBs^ - ^^2 + +. . . 

Now let the logarithm of the series on the right, when 
expanded in ascending powers of 0, be denoted by 

0^ i0^ 

log fl^£l) = - iepi - + yjPs + J|P« - . . . (4) 

80 that 



«1 1 | ^3 = 

*1 

1 

0 


^2 1 * 

»a 

*1 

1 



% 

*3 

23 ^ 
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The quantity is called * the seminmHant of order k By 
(4) the fundamental formula 

m = - • • «n(An^), 

or log n{e )=log n^{kje) + log Q^{k^e) + . . . + log ^in{K0), 

may now be expressed by the simple statement that the kth 
seminmriant of a sum of deviations €^ + €^ + . . . + is the sum 
of the kth seminvaHants of the individual deviations, with the 
further remark that the Ath seminvariant of Xe is A* times the 
seminvariant of €. 

86. Approximation to the Frequency Function. —We shall 
now derive from the result of the last section a formula of 
great practical importance by introducing the assumptions 
(which have not as yet been used) that the constituent devia¬ 
tions €i, €21 are very numerous and are individually very 

small. It is easily seen that by shifting the zero-point from 
which the deviation €r is measured wo can secure the vanishing 
of its first seminvariant: we shall suppose this effected for 
each of the constituent deviations Cj, 

» Let 0, ^ 31 , ... be the seminvariants for the devia¬ 
tion € 1 , let 0, i? 22 > -P 42 » • • • be the seminvariants for the 

deviation € 3 , and so on; then, as we have seen, the semin¬ 
variants of the aggregate deviation c are Pp Pg, . . ., where 

1 \ = 0 , 

Pg = ^1^21 ^2^22 ^^23 + • • • 

P8=Ai*iP31 +Vi’32 + ^% + - • • 

We shall suppose that Pg is finite; and we shall further 
suppose that the constituent deviations are very numerous and 
are approximately of the same order of magnitude (or at any 
rate that a very large number of them are of the same order of 
magnitude and the rest are smaller than these), so the condition 
that Pg is finite implies that each of the quantities Xr^ 2 r is a 
finite quantity multiplied by Ijn, and therefore \r>JP 2 r is of 
the order of n‘K This being so, K^psr will be of the order of 
and P 3 will be of the order of p 2 rP 2 r~^'^~K aud so 
will—at any rate in a large class of cases—be small compared 

* Hhioie, Almindelig Jagttagelseslaere, 1897: Theory of Ohseroa- 

tioMf London (0. k E. Layton), 1903. 



172 


THE CALCULUS OF OBSEKVATIONS 


with Pj, on account of the factor wi Similarly, P* will be 
small compared with Pg, and so on. Thus for the aggregate 
deviation we have 


log fi( 0 ) = 




i^p 4- ^p - 

31^3 + 41^4 • • 


where Pg, P 3 , P 4 , . . . are a rapidly decreasing sequence of 
quantities, and indeed in a large class of cases P 3 , P 4 , . . . are 
negligible compared with Pg. The probability that the aggre¬ 
gate deviation lies between x and x + dx is therefore <l>(x)dx 
where, by equation ( 2 ) of the last section, 






’ do. 


Writing 


e*! • • • = 1 + A(i^)8 + B{i0)* + C(iO)^ +. . ., 

where A, B,. . . are in a large class of cases negligible compared 
with Pg (as follows from what has been said above), we have 

Now “j e 2 ^ 2 , 

and therefore 


In the large class of cases to which we have referred, when 
A, B, 0, . . . are negligible, this becomes 


<^(x) = 


(2-P2)* 


1 

J- 2Pa 


(B) 


The formula (B) is due to Laplace, and the more general 
form (A) to various later writers.* These formulae specify 
the nature of a frequency distribution in which the deviations 

* The first of the additional terms was found by Poisson, and the rest by 
J. P. Oram, Oni Jiaekl’eiidviHinffer (1879); Thiele, Forelaesninger over 
Almindtlig lagttageUeslaere (1889); Elemerdwr lagttagelseslaere (1897); Edge- 
worth, Phil, Mag, 41 (1896), p. 90; Bruns, Ast, Nach, 148 (1897) col. 329; 
Charlier, Jrkiv for Mcdh, 2 (1905), No. 20. 
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from the average measure are due to the joint effect of a very 
large number of independent causes, each of which singly has 
only a very slight influence; the deviation of an individual’s 
chest measure from the average, for instance, being thought of 
as affected by inheritance from a very large number of remote 
ancestors. 

87. Normal Frequency Distributions and Skew 
Frequency Distributions. —Formula (B) of the last section 
shows that, at any rate in a large class of cases, when a devia¬ 
tion € is constituted by the summation of a very large number of 
independent deviations the probability that € lies 

between x and x + dx is 


1 


"2P. 


^dx. 


where is independent of x. This is called the normal law of 
deviation. 

Now consider a frequency distribution consisting of N 
individuals (where N is a very large number), and suppose that 
the number of these individuals whose measure is between x 
^nd x + dx is denoted by N<^(a/’)efo, so that is the prob¬ 
ability that an individual taken at random in the whole group 
has a measure between x and x + dx. Let a denote the average 
value of X for all the individuals of the group, so that [x - a) is 
the deviation of the measure of an individual from the average. 
Then on the assumption that the deviation is due to the 
operation of a very large number of independent causes, each 
of which makes only a very small contribution to the total 
deviation, we have seen that the probability of a total devia¬ 
tion between x-a and x-a + dx is (at any rate in a large 
class of cases) 

(27rP2)i® 


where Pg is independent of x. Frequency distributions to 
which this applies are called normal frequency distributions. 
Other distributions, for which the law (B) must be replaced by 
the more general law (A), are generally called skew frequency 
distributions. 
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As the subject of the present chapter is normal distributions, 
we shall say nothing more about skew distributions except 
that formula (A) leads to an expression for a skew frequency 
function as a series of the functions which are called parabolic- 
cylinder fuThctions* or Hermite's functions. In most cases two 
or three terms of the series are adequate to represent the 
function with sufficient accuracy. 


As an illustration of the way in which a great iiiiiuber of frequency 
distributions in nature conform to the normal law, we give the following 
data relating to 1000 observations made at Qreenwich, of the Bight 
Ascension of Polaris. Let os denote the deviation of one such measure 
from a value near the mean of all the measures, expressed as usual in 
seconds of time; let y denote the number of measures for which the 
amount of the deviation is x ; let y^ denote a value calculated from the 
theoretical equation of a normal frequency distribution, 





where a= -0*06 and /i = 0*6. Then we have the following results : 


X. 

y- 

2 /'. 

-3-5 

2 

4 

-3-0 

12 

10 

-2-5 

25 

22 

-2-0 

43 

46 

-1-5 

74 

82 

-1-0 

126 

121 

- 0-5 

150 

152 

0 

168 

163 

0-6 

148 

147 

1-0 

129 

112 

1-5 

78 

72 

2-0 

33 

40 

2-5 

10 

19 

3-0 

2 

10 


The agreement of y and y' is, generally speaking, good. 

Ex .—r measurements are made of a certain quantity^ each measure 
"being liable to an error which may have any value between € and — e, all 
such values being equally likely. The sum of the r measures is denoted by s. 
Show that when e tends to zero and r increases indefinitely in such a way 
that ejr tends to a finite value k, then the probability that s lies between x 
and x + dx tends to 



* For an account of these functions cf, Whittaker and Watson, Modem 
Analysis^ § 16*7. 
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88. The Reproductive Property of the Normal Law 
of Frequency. —Consider a frequency distribution which is 
normal, so that if e denotes the deviation of an individual 
measure fix)m the average, then the probability that € has a 


value between x and x + elm ia 


^writing for 


Jir 


The constant h was called by Gauss the modulus of precision. 
The function fl(0) is, by equation (3) of § 85, 


n(0) = f e-^<j>(x)dx 

•' — CO 

-jLre-^- 

~ jJ-J 




02 

“TO 


SO 


log 


The only seniinvariant is therefore of order two and has the 

value From the additive property of seminvariants it 

follows that a deviation which is formed by the summation of 
any number of partial deviations, each of which obeys the 
normal law, has all its seminvariants zero except the second; 
and therefore this aggregate deviation itself obeys the normal 
law. This is the reprodvxtive property or groxip property of the 
normal law of frequency; an aggregate deviation, formed hy 
the summation of any number of deviations which obey the 
normal law, itself obeys the normal law, 

89. The Modulus of Precision of a Compound Devia¬ 
tion. —Let a deviation € be given as a linear combination 
Vi + V 2 + • • • + of a number of deviations 
each of which obeys the normal law, so that the probability 
that €,. lies between x and a? + efe is 

^-'^dx, 

Jir 

where A, is the modulus of precision for > 
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Then hj the last section, the seininvariant for €r is and 

therefore the seminvariant for +. . . + which is also 
of order two, is 


vv 

2 W 


2^7, 2 ^ • 

1 'h 


• + 


.V' 

K 


)■ 


Now, as we have seen, € also obeys the normal law of 
frequency; let the modulus of precision for € be H, so the 
probability that € lies between x and x + Ac is 


^/7r 

then the seminvariant for € is 




2H2* 


Equating this to the value 


obtained above, we have 




A2 


.+ 




This important formula gives the modulus of precision of a 
compound deviation in terms of the moduli of precision of its 
constituent deviations, when these are of the normal type. 

90. The Frequency Distribution of Tosses of a Coin.— 
The earliest example of a normal distribution of frequency was 
discovered by De Moivre in 1756 * in considering the following 
problem: A coin is tossed N times, where N is a very large 
number (which for convenience we suppose even and equal 
to 2n)\ to find the probability of exactly (^N-^) heads and 
(JN +p) tails. The probalulity of exactly n heads and n tails 

is, by an elementary formula of probability, Ee- 

placing the factorials by Stirling’s approximate value (§ 70), 

namely, z\^er^zr^^^27^y, this becomes (irn)-* or The 

probability of exactly {n-p) heads and (w+jf?) tails is 

sL 7-rrr- r.'j denoting the ratio of this to the former 

2^ {n-p)\{n+p)V ® 

probability by E, we have therefore 

^_ nlnA _ 

{n-p)]{n^p)V 

' * Doctrine of Chances, 3rd ed. (1760), p. 243. 
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BO log E = 2 log n\ - log [n -p)\- log {n +p )!. 

Eeplacing the factorials by Stirling’s approximation, we have 

log E = {2n + 1 ) log n-[n -p +log (n-p) 

- [n+p + \) log {n+p) 

= + 1 ) log (l + 1 ) log (l + 1 )} 

= --^- + 

n ’ 

2 «® 

Combining these results, we see that the probalility of exactly 

/ 2 \4 

(JN -p) heads and +p) tails is approximately ^ 

whence the prohahility that the number of heads will he between 

/This is evidently a case of a normal frequency distribution. 

Ex, 1 .—Show that the prohahility that the number of heads will he 

between JN + and JN — is 0-6827. 


Ex, 2 .—A coin is tossed 1000 times. Show that an absolute majority 
of the 2^®®® possible sequences gives the difference between the number of 
heads and number of tails less than tweniy-iwo, 

91. An Illustration of the Non - Universality of the 
Normal Law. —The following example* shows that a great 
number of causes, each producing a very small deviation, do 
not always by their collective operation give rise to a frequency 
distribution of the normal type. 

Suppose the law of frequency for each of the small con¬ 
stituent deviations Cg, € 3 , . . . is so 

tf>{x) = \e~^ when x is positive, 


(d 811 ) 


* Hausdorff, Leipzig Ber, 68 (1901), p. 162. 


7 
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and <l>(x) = when a: is negative. 

In this case we have (§ 85) for each of the constituent 
deviations 

= j JJ + ^j e-^+*dx 

1 

1 + e^ 

Let the compound deviation be defined to be e, where 

2/11 \ 

€ = -^e, + 3e, + g«3 + . • ■)■ 

Then (§ 85) the probability that « has a value between x and 
x + dxia <f>{x)dx, where 

Q{e)e^^d0, 

and ’ 

111 
= 2*^' ^ * ’ • 

IT® 

_ 1_ 

cosh ff 

>ru f ^/ ^ 1 f” 

Therefore 4>{x) = 

This integral may readily be evaluated by integrating p — -j j 

in the plane of the complex variable 0 round a rectangle of 
height V and indefinitely great breadth, one of whose sides is 
the real axis of d. The result is 

Thus the prohahUity that the deviation c has a value "between x 
and a? + (fo ts This is clearly dififerent from the 

normal law. 
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Theory asserts, and observation confirms the assertion, that the 
normal law is to be expected in a very great number of frequency 
distributions, but not in alL This state of things led in the past 
to much confusion of thought regarding the validity of the normal 
law, which was wittily referred to in Lippmann’s remark to Poincar^:* 
“Everybody believes in the exponential law of errors: the experi¬ 
menters, because they think it can be proved by mathematics; and 
the mathematicians, because they believe it has been established by 
obsei-vation,” 


92. The Error Function. —Having now established the 
theory of normal frequency distributions on a theoretical basis, 
we proceed to investigate the properties of these distributions. 
Denoting the modulus of precision by h, the probability that a 
deviation lies between x and ~ a? is 

or 

2 r 

where ^ix) = — 7 - / e^^dx. 


On account of its importance in the Theory of Errors of 
Observation, ^{x) is called the Error Function. 

If the probability that a deviation lies between x and - a; is 
given, ^(hx) is known, and therefore hx is determinate: so 
the deviation x corresponding to this probability decreases 
as h increases; that is to say, in the case when the devia¬ 
tions are the errors occurring in a set of measurements of 
a quantity, the precision of the measurements increases as 
h increases. This is the reason why h is called the modulus of 
precision. 

We must now see how the function <&(«:) can be computed-f 

1®. First we have 




• Poincar^, Calcul des prob. p. 149. 

t For the analytical relations of the Error Function, cf. Whittaker and 
Watson, Modem Analysis, §16.2. ^ 
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and, integrating this, 



a? 



This series converges for all values of x. 
For example, when £c = 0'5, wc have 

X ^ 

I“m'^2T5“377''’4T9~' ■ 

1111 1 
"2 24 320 6376 110692 ' 


= 0-50000-0-04167 
0-00313 0-00019 

0-00001 

= 0-46128, 

BO ^(x) = 0-46128 X 1-12838 

= 0-52060. 


2®. Next, write 


Jo 


Differentiating, we have 


or 




Now y evidently begins with a term x. So substitute 
y‘=x + 03? + h?-^cx' . .in the last equation and equate co¬ 
efficients of powers of x\ therefore we obtain 3a-2 = 0, 
66 - 2a = 0, 7c - 26 = 0, etc., and therefore 

2 , 2 * 2 * , 

Therefore 

This series converges for all values of x. 
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Tor example, when a; = 0-5, we have 
#(*)=. M2838xe-o-2»x 5 


xfi+i-l 

I 3 2 

= 1-12838 X 0-778801 xix I 


3.5 


22 ■*'3.6.7 


1 


1 

■ ^ + 


3.5X9 2*'*'' 
1 


^■*■6 ■''60 ■*'840 ■*■'15120 


■} 


= 0-56419 X 0-778801 x 1-000000 
0-166667 
0-016667 
0-001190 
0-000066 
0-000003 

= 0-56419 X 0-778801 x 1-184693 
= 0-62060. 

3®. Next, since ^^ = 2 e-^dx, we have 
.'0 

Jv - (aj) = 2 [ e-’^dx 
= f e-vy-idy. 


where a^=y, 


f 

• y 

= e-yy-i + ^y-ie-u]* +y g-vy-^dy 


1.3 


= e-yy-\- ■^-yy-\ + ~ e'S'y 


• 8 _. 


and therefore 

This is an asymptotic expansion* 
computation when x is large. 


1.3 


(2ir*) (2a^)'2 


1.3.5 
which 


IS 


1 

convenient 


for 


A table of values *!• is 

X, 4>(2I!). X, 

*(x). 

X, 

^(x). 

X, 

4*(x). 

0*0 

0-0000 

0-6 

0-6039 

1-3 

0-9340 

2-0 

0-9953 

0-1 

0-1125 

0-7 

0-6778 

1-4 

0-9523 

2-1 

0-9970 

0-2 

0-2227 

0-8 

0-7421 

1-5 

0-9661 

2-2 

0-9981 

0*3 

0-3286 

0-9 

0-7969 

1-6 

0-9763 

2 3 

0-9989 

0*4 

0-4284 

1-0 

0-8427 

1-7 

0-9838 

2-4 

0-9993 

0-477 

0-6001 

1-1 

0-8802 

1-8 

0-9891 

2-5 

0-9996 

0-5 

0-5206 

1-2 

0-9103 

1-9 

0-9928 



* Cf. Whittaker and Watson. Modern Analysis^ 

, ch. viii. 




*1* More extended tables will be found in a memoir by J. Burgess, Trails, Roy, 
Soe. Edin, 89 (1898), p. 267. Cf. also W. F. Sheppard, Biometrika, ii. (19p3), p. 174. 
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Evi, 1 .—Show that the probability that the deviation ie greater than a 
is slightly less than 0*001 when ^(hx) = 0*9981 or hxs=2^2. 

Ex, 2,—Obtain the formula 

g-aja 1 

1 - - 7 - 

X^TT ^ ^ y 


1 +- 


1 +- 


1 + 


3y 


1 +. 


where y=— 2 . 

93. Means connected with Normal Distributions,—^With 
the normal law of deviation 

the arithmetic mean of the wth powers of the absolute values 
of the deviations is 

JwJq 

Writing this becomes 

1 ^00 n-l 


or 


(t) 


./irA" 

In paxticukr, writing <r for “w® have 
Arithmetic mean of the absolute values of— 


First powers of the deviations 

1 

hjv 

Second „ 

99 99 

1 

2h?~ 

Third 

99 99 

_ 1_ 

h^^ir 

Fourth „ 

99 99 

3 

“4A<~ 

Fifth „ 

99 99 

II 


272 


a*. 


= 3a^, 

^ * Gf. Whittaker and Watson, Modem Analysis, § 12.2. 
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From the above we see that the square of the mean of the 
absolute demotions^ divided by the mean of their squares^ has the 

value —. 

TT 


1.— l^how that in the curve y= —r the •product of the ahedsea 

and the eubtangent is constant. 

Ex. 2 .—Show that in the curve y= ^he abscissae of the points 

of inflexion are ± a-. 

94. Parameters connected with a Normal Frequency 
Distribution. 

1®. Let the curve which represents graphically a normal 
frequency distribution be written 

«=4- 

Then the arithmetic mean of all the observations 

= r 4 

y-oo 

2®. The quantity <r which has been introduced in the last 
section is (as we have seen) the square root of the arithmetic 
mean of the squares of the deviations^ and has the value 

1 

V2‘ 

(T is called the standard deviation, or the quadratic mean devia¬ 
tion or the error of mean square.* 

In terms of <r, the normal law is that the probability of a 
deviation between x and a; + is 


r^(27r) 


e ^^^dx. 


By the formula which gives the measure of precision of a 
linear function of deviations we see that the square of the 
standard deviation of a sum of quantities is the sum of the 
squares of the standard deviations of the separate quantities. 

* It was called by Gauss the rnewn error. 
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Hence the standard deviation of the arithmetic mean of n 
t^iUUies is n~i x the standard deviation of one of the quariiities. 

3°. The arithmetic mean of the absolute values of the 
deviations is called the mean ahsolute deviation.* Denoting it 
by 1 ), we have (as we have seen in § 93) 

1 /2 

^ ~h^w \ ir 

The arithmetic mean of all the observations that are greater 
than the mean a is a + • <r, so, denoting this by A, we have 

(A-a) = 1-253 (A-a). 


This formula is convenient when the observations are given in 
the rough, not arranged in order of magnitude; for then, after 
adding them all and dividing by their number so as to obtain 
a, we have merely to add all that are greater than a and divide 


by 


their number in order to obtain a + 



4®. The prohahle error or quartile I is defined to be such that 
the chances are even whether the deviation exceeds it in 
absolute magnitude or is less than it. So if Q denotes the 
quartile, we have 


2 .JttJq ^ 


-ftSAa 





-t* 


dt, 


which gives 


QA = 0-476936 = p (say). 


or 


^ = 0-67449 ^roughly 


so cr=1.4826Q, Q = 0.67449(r. 

Q is connected with rj by the equations 

7y = 1.1829Q, Q = 0-84536 i7. 

Q is evidently that deviation which stands in the middle of the 
sequence when the deviations are arranged in order of absolute 

* It was called by Laplace the mean error, 
t The name is due to Galton. 
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magnitude. It therefore furnishes a means of determining the 
parameters A or <r of the curve as follows: 

Let there be n measurements. Take n equidistant points 
along a base-line, and at these points erect ordinates pro¬ 
portional respectively to the measures arranged in order of 
magnitude. We thus obtain a curve as in the diagram: 


A* 



Divide the base AB into four equal parts by points of division 
C, D, E. The ordinate at D is the mean value a of the 
measures. The ordinate at E is such that there are as many 
measures greater than it as lie between it and a. The difference 
^'DD' ~ EE' or CC' - DD' is the Quartile Q. 

More generally, we can determine the parameter A or o* of 
a curve by finding a measure such that (say) p per cent of 
all the measures fall within the interval between the mean a 
and Xy In order to obtain o-, we have only to multiply - a) 
by a known numerical factor depending on y. 

The degree of accuracy of this and the other methods of 
determining o- will be considered later (§ 103). 

For expressing results in terms of probable error, the following 
form of the table of the function ^(x) is useful: * 

0*5000000 = ^(0*4 769363) = #(/>), where pjh is the probable error 
0*6000000 = ^(0*5951161) = ^(1*247790/)) 

0*7000000 = #(0*7328691) = ^(Vb^GGlSp) 

0*8000000 = #(0*9061939) = #(l*900032/o) 

0*8427008 = #(1) =#(2*096716p) 

0*9000000 = #(1*1630872) = #(2*438664p) 

0*9900000 = #(1*8213864) = #(3*818930/o) 

0*9990000 = #(2*3276754) = #(4*880475p) 

0*9999000 = #(2*7510654) = #(5*768204p) 

1 =#(oo) 

♦ It is due to Gauss, TFerke, 4 , p. 109. » 


(0 811 ) 


7 * 
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We see, therefore, 

The probability that the error exceeds— 

2*438664 times the probable error is 
3*818930 „ „ „ etc. 

Ex,—A number of bodies similar in shape and density differ slightly 
in size, their lengths being grouped about a mean a with standard deviation 
O'. If the weight of a body of length x is cac®, show that the weights are 
grouped about a mean ca® with the standard deviation dcaV. 


95. Determination of the Parameters of a Normal 
Frequency Distribution from a Finite Number of Observa¬ 
tions. —In the preceding section we have shown how to find the 
parameters a and o- (or ly or A or Q) of a normal distribution, 
assuming tacitly that the observations are infinite in number 
so as to furnish continuous distribution. In reality, however, 
the number of observations is finite, and we have to determine 
the best values of the parameters from them. 

Suppose there are n observations giving the values 
a^, . . ., Xn respectively for x. The a priori probability of the 
value x^ is 

1 _(gi~a)» 

2<r* 

^(27r)o' 

and therefore the a priori probability that the observations will 
give the set of values actually observed is 

/ 1 (gi-o)*+(gg~gy+. « .-fCgn-g)* 

The most probable hypothesis regarding a and o* is that 
which makes this quantity a maximum when x^, ojg, . . are 
supposed given. Taking logs, we see that 


n 


logo- + • + 


0n 


or n say, most be a minimum, and therefore ^ - 0, which gives 
0 = (a^ - a) + (asj - a) +. . . + (a!„ - a). 


or 




( 1 ) 
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Moreover, ^ 

0(T 
0 = 

and therefore 


= 0, which gives 

n ^ [x^ - aY + (9*2 - +. . . + {x^ - a)* 

cr (T® * 

n 


( 2 ) 


Thus the formulae (1) and (2) for a and o* are determined 
directly from the theory of Inductive Probability. This is, 
strictly speaking, the only correct method when the number of 
observations is finite. 

Ex. 1 .—Numbers which are given as decimals to some definite number 
of places are usually forced, i.e. the last digit retained is increased by 
unity when the first digit not retained is 5, 6, 7, 8, or 9. Find the 
standard deviation or mean error to which a number is liable^ due to 
abbreviation with forcing. 

Denoting the standard deviation by o*, we have 

^ Sum of squares of all possible omitted “ tails ” 

” Number of these ‘‘ tails ” 



since all tails c between — 0*5 and 0*5 (in units 
of the last digit retained) are equally probable, 


(0.5)®-(-0-5)® 

3 


giving (r = 0*2887. 

Ex. 2.—Hence show that the mean error liable to occur in the sum 
of 1000 numbers, each of which has been abbreviated with forcing, is 
less than 10 units of the last place. 

Ex. 3 .—A quantity is repeatedly measured, the measures being subject 
to errors of observation. Assuming the law of facility of error to be such 
that the probability of a value between x and x + dx is 

where b and h are constants, determine the most probable values of the para¬ 
meters b and k when, n observations give the values x^ . . ., acn? ® i 
the values x^, x^ . . . being arranged in ascending order of magnitude. 

Here the a priori probability that the observations will give this set 
of values is proportional to 

+ + • • • 

2»» s 
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The most proljable hypothesis regarding h and k is that which 
makes this quantity a maximum, when Xn are supposed 

given. 

Taking logs, we see that 

Il = 2n]ogk-k^{\xj^--h\ + \x^'-h\ +, . .+la;,i-6l} 

0JJ 

is a maximum, and therefore ^ Suppose that b lies between Xr and 
Xr+i, Then 
0 

gj{|a;j-6| + |a;2-6[ + . . . + |a;,j-6|} 

0 

= ^{(6-®i) + (6-*2) + - • .+(6-:>-r) + C'V+i-6) + - • . + (»»-6)} 
= (1 + 14*. • . + l)rterms + ( — 1“1. • . — 1 )(h— r)terms9 
011 

and therefore is zero when r = n - r or r = |?i. We see, therefore, that 

the most plausible value of h is that one of the quantities aCg, . . ., x^ 
which stands in the middle of the sequence when they are arranged in order 

0 IT 

of magnitude. This value is called the median. The condition '^ = 0 

then gives for the determination of the most probable value of k the 
equation 

• •+K-6|}. 

96. The Practical Computation of a and o-.—In calcu- 

lating a and <r, when we are given that the measures x^, x^ . . . 
have occurred • • • times respectively, we generally 

find it convenient to subtract some fixed number c from each 
of the in order to have smaller numbers to deal with. 
Write-c = etc. 

Then we form a column of the quantities &» • • • 

then „ „ „ 1/i^v • • * 

then „ „ „ y^^, y^^, . . . 

the value of being obtained by multiplying y^r by fr. 
and sum the column of y’s and the last two columns. We 
then have 
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and ^ = 

2y 

= %^ + 2(c-«)||+(.-«)* 

=M_2(c-«)*+(c_a)*, 

or (2) 

Thus a and <r are determined. 

As a control for the computation of a, we may use the 
equation 

Sum of all positive residuals = Sum of all negative residuals, 
and as a control for the computation of o-, we may use the 
equation 

na^ = 2(aj - 

or na^ = SM^ - 2a2M + a2M = 2M^ - a2M, 


where 2M denotes the sum of the measures and 2M^ denotes 
the sum of the squares of the measures. 

97. Examples of the Computation of a and or. 

Ex, 1 .—The chest measiirements of 10,000 raen are given below, x 
being the measure in inches and y being the number of men who have these 
measures. Find the two constants a and or which specify the frequency 
curve, obtaining the standard deviation firstly from the mean-square of the 
deviations, and secondly from the mean absolute deviation. 


X. 

y- 


y/^ 

yf*. 

33 

6 

-7 

- 42 

294 

34 

35 

-6 

- 210 

1260 

35 

125 

-5 

- 625 

3126 

36 

338 

-4 

- 1352 

5408 

37 

740 

-3 

-2220 

6660 

38 

1303 

-2 

-2606 

5212 

39 

1810 

-1 

- 1810 

1810 

40 

1940 

0 

0 

0 

41 

1640 

1 

1640 

1640 

42 

1120 

2 

2240 

4480 

43 

600 

3 

1800 

5400 

44 

222 

4 

888 

3552 

45 

84 

5 

420 

2100 

46 

30 

6 

180 

1080 

47 

5 

7 

35 

245 

48 

2 

8 

16 

128 

Total 

10000 


-1646 

42d94 
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Therefore 

1 

o 

II 

1646 

10000-»»-8®®« 


and 

2 42394 

~ 10000 ■ 

-(0-1646)*= 4-2123, 


BO 

a 

= 2-052. 


In Older to find 

cr from the 

mean absolute deviation, we 

following table: 




X. 

y- 

o' 

1 


33 

6 

-7 

- 42 

34 

35 

-6 

- 210 

35 

125 

-5 

- 626 

36 

338 

-4 

- 1352 

37 

740 

-3 

-2220 

38 

1303 

-2 

-2606 

39 

1810 

-1 

-1810 

39-668 
(=39-500 to 
39-835) 

Total 

650 

-0-332 

- 216 

6007 


-9081 


Now 


«-cr^? = 


or 


Therefore 


Arithmetic mean of all observations less than a 
=^ = 40 + M 


39-835 


9081 


""V 7r~^® 6007 


<r= 1-649x1 


or 


0- = 2-066. 


Ex, 2.— Find the mean value and standard deviation for the following 
frequency distribution^ y being the number of occurrences of the measure x. 
Trace the normal curve which has this mean value and standard deviation 
and compare it with the original data. 


oc. 

1 

2 

3 

4 

6 

6 

7 

8 

9 

10 

11 

12 

13 

14 

y- 

6 

10 

2 

11 

15 

18 

18 

34 

32 

39 

39 

46 

50 

43 

oc. 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25. 




y- 

38 

26 

34 

28 

22 

17 

11 

12 

11 

4 

4. 





Ex, 3.— Determine the standard deviation from the mean absolute 
deviationan this last example. 
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Ex, 4. —The chest measures of 10,000 men are as follows^ x denoting a 
measure in inches^ and y the number of men who have that measure* 
Determine the standard deviation by finding the place of the quartile. 


x. 

33 

34 35 

36 

37 

38 

39 

40 

41 

42 

y- 

6 

31 141 

322 

732 

1306 

1867 

x882 

1628 

1148 

Ty. 

6 

36 177 

499 

1231 

2636 

4403 

6286 

7913 

9061 

X. 

43 

44 

46 

46 

47 

48. 




y- 

646 

160 

87 

38 

7 

2. 





9706 

9866 ! 

9953 

9991 

9998 

10000. 





It is to be remembered that under “05=38” are grouped all men 
whose measures are from 37^ to 38^ iiichea Tlic sum of all measures 
from the lowest to a; = 38 {i.e, to 38^ inches) is 2636; and the rate of 
increase of men is about 16 per inch at 38so that 2500 will be 
the sum of all to 38*478 inches. 

The Arithmetic mean a of all the measures is found to be 39*834, 
Q = 39-834-38-478 = 1-366, 

BO 0 -= 1-483 xQ = 1-483 X 1-366 = 2-01. 

Let us now find the quartile from the other end of the sequence. 
The sum of all measures from the largest down to a; = 42 (i,e, to 41^ 
inches) is 2087. Adding half the number opposite 41 in the table, 
we see that the number down to 41 inches is 2901, which is 2600 + 401. 

401 

The quartile will therefore be at 41 + a;, where x = = 0*246, 

so that Q = 41*246-39*834= 1*412, 

and 0 *= 1*483 x 1*412 = 2*09. 

The discordance between this and the former value is, of course, due to 
the aljsence of perfect normality and continuity in the distribution. 

The mean of the two values of <r obtained by the quartile method is 
therefore o* = 2*05. This is close to the valuer obtained by the method 
of § 97 and the mean absolute deviation method, which are found to be 
2*05 and 2*06 respectively. 

98. Gomputation of Moments by Summation.—The 
quantities Mq = 2y, Mj = Mj = . . ., which are called 

the moments, may be readily formed by mere addition in the 
following way. 

We use the notation ly (read “ sum of y ”) to denote the 
function whose first difference is the function y, so that if 
A«= y, then n = 2y. The symbol 2 corresponds, in the Calculus 

of Differences, to the symbol j which represents indefinite in¬ 
tegration in the Infinitesimal Calculua Just as a column of 
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fint differences of « can be formed from a column of values of 
n by subtractions, so a column of values of Zy can be formed 
from a col umn of values of y by additions. 

Suppose the set of given values of y is y„ yr+y, yr+j, . . 
corresponding respectively to the values r, {r +1), (r + 2), 
. . (r+7i), of X. Form a table of sums of the function y by 

additions from the foot of the column, thus; 



y- 


2V 

r 

Vr 

yr+y^+i+- • .+yr+« 

... 

r + 1 

yr-\‘\ 

y,+l + . . . + yr+n 

y,+i + 2y,+2 + . . . + nyr+n 

r + 2 

yr+2 

y,+2 + * • • + yr+n 

yf+2 + 2yr.|.8 + . . . + (»-%,+„ 

r+n-2 

yr+n-2 

y»'+ti-s+» • •+yr+« 

yf+n-eH" 2yr+»»-i + 3y+,.,i 

f + 7l-l 

^r+n-l 

yr+n-1 + yr+» 

Vr+n-l + 2yr+n 

r+n 

yr+» 

yr+n 

Vr+n 




Sum= Sj 


The uppermost number on the hy column is evidently Mg, 
the moment of zero order. Let the uppermost number in the 
1y column be denoted by Sg, and the uppermost number in the 
column be denoted by Sj. Then 

Sg=Mg. 

Si = yr+l + 2yr+2 + 3yr+3 + ' • • + ®yr+n 

where Mj=?^,.+ (r + l)yr+i + . . . + (»’ + »)yr+„ is the moment 
of order 1. 

The sum Sg of the column is 

Sg “ yr+1 + 3yr+2 + +. . . H 2 ^yr+» 

= HMg-(2r-l)M, + r(»-l)Mg}, 

where Mg=r*y,+(r+l)*yr+i + . . . + (r + M)*yr+n is the moment 
of order 2. Thus Mg and Mg may be determined from the 
equations 

Mg = Sg + rMg, 

Mg-2Sg + (2r-l)S, + rSMg, 
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and therefore if these relate to a frequency distribution of mean 
a and standard deviation cr, we have 


or 


^ w- 


■jMo 






Mo Mo* 



Mo Mo W’ 



The higher moments may then be found in the same way if 
desired.* 


Let us apply tliis to the computation of a and tr for the chest measures 
of Ex. 1, § 97. 


£C. 

V- 

2y. 


33 

6 

O 

o 

o 

II 

02 

... 

34 

35 

9994 

68354 (=S^) 

35 

125 

9959 

58360 

36 

338 

9834 

48401 

37 

740 

9496 

38667 

38 

1303 

8766 

29071 

39 

1810 

7453 

20315 

40 

1940 

5643 

12862 

41 

1640 

3703 

7219 

42 

1120 

2063 

3516 

43 

600 

943 

1453 

44 

222 

343 

510 

45 

84 

121 

167 

46 

30 

37 

46 

47 

5 

7 

9 

48 

Therefore 

Mo = 

2 

10000 = So, 

2 

2 

82 = 288862 


S, 

a = ^ + r = 6-8354 + 33 = 39-8354, 
and <ra=2|?-|i- f|i? = 57-7704-6-8364-(6-8364)* 


= 4-212 as before, giving (r= 2-052. 


* For a fuller investigation, showing the advantages of central or mean 
sums, cf. a note contributed by G. J. Lidstone to G. F. Hardy’s Construction of 
Tables of Mortality (London, 1909), printed on pp. 124'128 of tliat work. 
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Ex, 1.— Udng the data of Ex, 2, § 97, compute the value of <r by the 
method of mmmtion. 

Ex, 2.— l%e mean daily temperature at Brussels on the 310 days of the 
months of July in ten years was as follows: 

Mean temperature . ll*-5 12‘’-6 13“-5 14**5 16‘’‘6 16“-5 18“-6 

Number of days . . 1 9 21 24 33 32 49 35 

Mean temperature , 19*-6 20"-5 21**5 22*-5 23‘’-5 24‘**5 25®-5 26‘‘-6 

Number of days , .31 24 21 17 7 2 3 1 

Find the mean temperature and standard deviation by the method of 
summation. 


99. Sheppard’s Corrections. —We shall now investigate a correction 
which is to be applied in calculations such as those of the preceding section. 
Let y^f{x) be the equation of a frequency curve. The integral 


f 


xiPf(x)dx^ 


which is called the pth moment of the curve, will be denoted by mp. 

The statistical data which specify frequency curves are often given 
in a summarised form from which the moments cannot be computed 
directly with accuracy. For instance, in statistics of the chest measure¬ 
ments of a group of men, all men whose chest measure lies between 38 J 
and 391 inches might be given under the heading *‘39 inches”; all 
men with measures between 39^ and 40 inches might be given under 
the heading “40 inches,” and so on. The number given under the 
heading “ 40 inches ” is therefore not a true ordinate of the frequency 
curve, but is really the area of that strip of the frequency curve which 
is comprised between the ordinates at a; = 39^ and x = 40^; that is, it is 


r 

•'sol 


f{x)dx. 


Suppose, then, that . . . x_^ a;_p x^ x^, x^ ,., are the values of 
X for which statistical data are given, these values being spaced at 
equal intervals k;, and suppose that the statistical data are the numbers 


. . . Uf^ Wp Wp 

calculate the quantities 


where = 



and 


suppose we 


mp' = 2 XfPug, 

«SS —OO 


( 1 ) 


so that mp' is a rough value for the pth moment, obtained by a process 
which is equivalent to collecting at Xg all the individual m^ures between 
Xf - \w and x^ + \w. 

The problem before us is Jo obtain a formula which will enable us to 
calculate Jhe true moments mp from the rough moments m^, Wc shall 
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suppose that the frequency curve has close contact with the axis of x at 
both ends ; so that 

oo 

xPf(x)dx>=w 2 ( 2 ) 

S=s —00 

Now, by the Newton-Stirling formula (§ 23), 

f(x, + nw) =Jlx,) + {A/(®.) + A/(** - w) > + ^ 1 A */(*» - «) 

+ ’*^”31 { A*fiXi - w) + - 2w)} + A*^'(g« - 2w) + • • .> 

SO we have 

/'* 1 17 

I /(!r» + «w)<*»=/(®.) + 3j^AV(®»-«’)-3j|^AV(®«-2«’) + . . . 

•'“i 



In particular, when f(x) = «* we have 


A^f(xg — nw) = 2®”e** sinh®” ^w. 

Putting 0 = ^w and dividing throughout by e*-, the expansion reduces to 

sinh 6 ^ I . , o /I • 1 Azi 

—B -l + + . 

whence the coefficients in the general expansion may be determined 
I'eadily. 


1 rx,+ivi ri 

Since -1 /(x)dx = I /(«»+ nw)dny 

•'-1 


1 sinh 6^^ 

we may write 

where sinh®"^ stands for the operation We have therefore 

from (1) 



V I « V \ 

^ — XgPUg — XgP —7j /(*#)• 

= _floW 5 =-Q 0 " 


00 » 

Also since 2 XgPf(pCg + qw)^ 2 f(xg)xg_^, 

g~-a3 »=-oo 

the terms of both sides of this equality being the same but counted 
differently, we have 


Therefore 


2 ®,i»(AaE"i)VW= 2 /(a;,)(AaE“i)%,P. 

fs.oo »--oo 


2 f{xs)^^xgP^ 2 /w{i + |5+^,+ip+* • 

w fs^OO ^ «=S -00 V M. V. • 
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fJb 

Now if D denotes -r- we have 

dXj 

E = and A = - 1, 

so J A^E " 1 = sinh^ 

Hence we see that 6 = \wT) and 

w^y^ io®D® 


’ =^?_„ M ^ + 3I2*^M2i + 7T2-• + • • -K 

■= y(*») [kuP + 3^a2’(l’ -1)*.!'' ® 

+ ^P(p-l)(i>-2)(f>-3).T,P-*+. . .J, 


or finally, substituting from equation (2), we have 




«tp = Wip + ^aPOM - l)mp_g + ^iP(P - 1)(P - 2)(p - 3 )«Ij,..4 +... (3) 


Takingp—1, 2, 3, 4, 6 in succession in (3), we have 

= Wjf 

Wg' = Wg + 

wig' = Wg + 

m/ = W 4 + + ^V^o» 

^ 5 ' = ^6 + 

and hence 

Wg = mg', 

TWj = 

^2 = '^2'“ 

Wg = W-g' ~ 

^4 = ^4 “ + a I C^o'» 

mg = mg' - §whng' + 

These formulae, which express the true moments in terms of the 
approximate moments, the curve being supposed to have close contact 
with the axis of x at both ends, are due to W. F. Sheppard.* 

100. On Fitting a Normal Curve to an Incomplete Set of 
Data. —It sometimes happens that we wish to determine a normal 
curve when we know only the ordinates i/j, l/g, . . . of a set of points 
of abscissa asj, ajg, ac^, . . . which are extended over part of the curve, no 
information being available regarding the rest of the curve. In this 
case it is best to treat the problem as one of fitting a parabolic curve 
« a= a + fex + cx® for the given values of z where z = log y. The constants 
a, 6, c may be determined by the method of Least Squares (Chapter IX.). 

101. The Probable Error of the Arithmetic Mean.—Let 


^ Proe. Zond. Maih, Soc. 29 (1898), p. 358, 
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Wq denote the arithmetic mean of n measures Mp Mg, . . M», 

so that 

^o = ^(Mi + M 2 + . . . + Mn). 

Then if denote the modulus of precision of the arithmetic 
mean and h denote the modulus of precision of a single 
observation, from the formula for the precision of a linear 
function (§ 89), we have 



or Aj = hjn. 

The probable error of a single measure being connected 

with the modulus of precision by the equation Q = p where 

p = 0-476936 (§ 94), we see at once that the probable error of the 

arithmetic mean i^ timee the probable error of a single 

observation, 

102. The Probable Error of the Median. —Instead of 
taking the arithmetic mean of the measures of an observed 
quantity as the estimate of the true value of the quantity, 
suppose we now arrange all the n measures in the order of 
their magnitude and select the middle one, which is called the 
median,^ Usually the median will be close to the arithmetic 
mean and will furnish an independent estimate of the observed 
quantity. 

A more precise definition of the median is as follows : 

Let ttp • • -I cbn a set of real numbers, which may or may 

not be all distinct. Let 

= 2 (a; - 
i=l 

The value of x w'hich reduces to a minimum is the arithmetic mean 
of the numbers etj, . . ., a^,. If the condition that 82 ( 0 ;) be a minimum 
is replaced by the condition that 

n 

S^(a;)= 2 \x-ai\ 

Idc reduced to a minimum, the median of the a’s is obtained. It is 

* Cf. § 95, Ex. 2. 

t Dunham Jackson, Bull, Am, Math, Soe, 27 (1921), p. 160.» 
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uniquely defined whenever » is odd ; if the numbers at are arranged in 
order of magnitude, so that 

and if n = 2A; —1, the median is simply ajt, the middle one of the a’s. 
The median is uniquely defined also when n is even, n = 2A;, if it happens 
that being then equal to this common value. Otherwise the 

definition is satisfied by any number x belonging to the interval 

aie^x^aje+i, 

and the median is to this extent indeterminate. But for each value 
ofp>l there is a definite number x=^Xp which minimises the sum 

S,(x)= 

and Xp approaches a definite limit X as p approaches 1. The value of X 
coincides with the median as already defined, in the cases where that 
definition is determinate, and when n = 2k and ajc =4= a]^+ij X is a definite 
number between aie and a^+i. It thus serves to supplement the 
former definition. 


We shall now find the probable error of the median in order 
that we may judge of the relative advantages of the arithmetic 
mean and the median as estimates of the true value of the 
observed quantity. 

Suppose we have n measures (where n is supposed a great 
number). Then the probability that any one measure exceeds 
the true value is ^ and by § 90 the probability that exactly 
(Jw + r) of the measures exceed the true value is 



Now if A is the modulus of precision of the measures, 
the probability that a measure lies at a distance between 0 

or 
Jtt 


and f from the true value where £ is small is j 


approximately ; and therefore of n measures the number 


nh. 


between 0 and f is 


If this number is r, we have r — -ri. 

Of the (i» + r) measures greater than the irue value, |n 
exceed Therefore the probability that the median is at the 
point £ is 
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-dl 


Denoting the value of $ correspondiog to (r +1) by ^ + d$, then 
r + change from r to r +1 correBpondsto 

i/TT 

TkJl t 

an increase of in the number of measures, so ^ = 

The probability that the median lies between ^ and f is 
therefore 

\irn/ Jtt tt 

Thus the probability that the median lies at a distance 
between ^ and $ + d^ from the true value is 

where H = 

^ir 

For this result we see that the modulus of precision for the 

and therefore (§ 94) 

\/ \ n / 

the probable error of the median is 

lV(25> 

where p = 0«476936. 

In the last section we have seen that the probable error of the 


determination of the median is 


arithmetic mean is Thus the error to le feared when we 


take the median as the true value is 




or 1>253 times the 


error to he feared %ohen we take the true value to be the arithmetic 


mean. 

103. Accuracy of the Determinations of the Modulus 
of Precision and Standard Deviation.* — Denote the n 
deviations by . . ., e„, and the modulus of precision by h ; 
on the hypothesis that h has a value H, the a priori prob¬ 
ability of the occurrence of this set of deviations is 


. • +•«*> 


( 1 ) 


* Gauss, Werke, 4, p. 109 ; cf. R. A. Fisher, Month, Kot, M,A.S,fO, p. 758. 
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while on the hypothesis that h has a value (H + A) the a 
priori probability of this set of observations is 

(2jL^g-(H+A)*(ei*+ . . . +€„*) { 2 ) 

By the Principle of Inductive Probability, the ratio of the 
probability that (H + A) is the true value of h to the probability 
that H is the true value of h is equal to the ratio of the expres¬ 
sions (2) and (1); that is, of 

(l + g-(2AH4-Aa)(<i*+... +en*) unity. (3) 


Now let H be the most probable value of h (i-e, the value 
which makes (1) a maximum), so that 

^“\/{2(*,* + . . . + ;„»)}■ 

Then (3) may be written 

(x to umty, 

lonnitjr. 


or 


or 


tiA* tiA^ 

e" ***^’~’ * * to unity. 


or, approximately (A being very small compared with H), 


e to 1. 


Therefore the probability that the value of h lies between 
H + A and H + A + dA is nearly 

wAa 

aoo ^5[Aa 

where K is a constant, which, since / Ke" H®rfA.= 1, is given by 



Therefore the probability that the value of h lies between 
H + A and H + A + rfA is 
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or the modulus of precision for the determination of h hy the 
root-mean-square method is Jn/h. 

From this we deduce at once that the probability that the 
standard deviation o-, as deduced hy the root-mean-square method, 
lies between o- + a? and <r + x + da: is 


<r*/7r 


SO the modulus of precision for the determination of the standard 
deviation o- by the root^mean-square-method is JnjiT. Hence 
the probable error of the standard deviation o*, as deduced by 
the root-mean-square method, is 


0-476936 

- -O'. 

Jn 

Gauss * extended this by showing that the probable error 
of the standard deviation <r, when it has been deduced by 
computing the jpth powers of the errors, is 




04769360*^2 



i 


This gives the following results: 

When <r has been computed from the— 

I. 1st powers of the errors, the probable error = 
II. 2nd ,, ,, ,, ,, 


0-5095841 




0-4769363 

Jn 


III. 3rd 

IV. 4th 
V. 5th 

VI. 6th 


w n 


V ff 


99 99 


It 91 


0-4971987 

0-5507186 

” " > 

0-6355080 

0-7557764 
” ” /n 


It is evident, therefore, that the most advantageous method 
is the root-mean-square. In fact, 100 errors of observation 
• IFcrl-e, 4, p. 109. 
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yield by the computation of the root-mean-square as good a 
value of the standard deviation as 114 treated by L, 109 by 
III., 133 by IV., 178 by V., 261 by VI. There is not much 
difference as regards accuracy between I. and II., and of 
course I. is much more convenient for calculation. 

Lastly, we must consider the accuracy of the determination 
of the quartile by arranging the n errors of observation accord-* 
ing to their absolute magnitude, and taking the middle one as 
Q; or, more generally, arranging the n errors of observation 
according to their absolute magnitude, and then taking the 
error which has errors less than it, and deriving a value 
H for the modulus of precision from the equation 

Let h be the true value of the modulus of precision and let 
X be such that Jix = Uxp. Then x is the position that x^ would 
have if the number of measures were infinite, so that perfect 
accuracy in the determination of Xp could be attained. Writing 

1 we can show that the probability that out of 

n measures (all taken positively) m^ + r lie between 0 and x is 

1 _ rL. 

___ e 

^{2Trnpq) 


Now the probability that a measure (taken positively) lies 
between x and x +.f, where g is small, is 

and therefore of n measures (taken positively) the number 
between x and a; + ^ is 

If this number is r, we have 

so the probability that £Cp = a? + g is 
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and therefore (as in the corresponding discussion of the 

median^ § 102) the modulus of precision for the determination 

. . J(2n)he-^ 

—• Si^ce Ha?p = constant, we have 


and therefore by the formula for the precision of a linear 
function of deviations, we have 
Modulus of precision for the determination of h by this method 

=* ^ tunes the modulus of precision for the determination of 
*/(??”•) 

Hence the square of the standard deviation for the determina- 
tion of h by this method is 




or if t be the value of hx obtained from the equation ^ = 
it is 


^ 2n~ ‘ 


In particular, when we determine the quartile by finding 
the deviation which is in the middle of the series of devia¬ 
tions arranged in absolute order of magnitude, we have 
= q = t = p where p = 0-476936, and therefore the 
standard deviation for the determination of h by this method is 



so the probable error of the determination of the quartile by 
this method is 


p^2 • 



or 



• • Q, 


or, in numbers. 


0-79 

ijn 


Q. 


This result is due to Gauss: * it shows that (on the average 

♦ Loe, dt, > 
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of a large number of determinations) 249 measures, treated 
by this method, must be taken in order to yield as good 
a value of Q as 100 measures treated by the root-mean- 
square method: this determination is nearly the same in 
accuracy as that by the sums of the 6th powers of the errors. 
We can, however, choose much more advantageously than 
this: in fact, determining the minimum of the function 

we find that it has a minimum when ^ = about 1-05, and 
therefore the most accurate determination of the standard 
deviation is obtained when we determine the error x, which is 
such that about 86 per cent of the errors (all taken positively) 
lie below x and about 14 per cent above x : the probable error 
of the standard deviation found from this value of x is then 
only 1-24 times as great as the probable error of the standard 
deviation determined by the root-mean-square method.* By 
taking ^ = 1 we obtain the following easily remembered precept: 
the measure of precision is the reciprocal of that deviation which 
is exceeded {in absolute value) hf 16 per cent of the observed de¬ 
viations and not attained by Slf, per cent of them: for these 
percentages we can put more simply \ and 

104. Determination of Probable Error from Residuals. 
—In § 94 we have regarded a and o- simply as two parameters 
which occur in the problem of fitting a curve of the type 

1 

^ a-J[2Tr) 

to certain data. When, however, we are dealing with errors of 
measurement of an observed quantity, it is necessary to regard 
the problem from a somewhat different point of view. We 
must now take into consideration the fact that the quantity 
measured has a certain true value which, though unknown, must 
be regarded as possessing a physical existence: this true value 
must be distinguished from the arithmetic mean of the 
measures, which is merely the best estimate we can form of it. 
The differences of the measures from the true value of the 
* Ibis was pointed out by F. Hausdorff, Leipzig Ber, 53 (1901), p. 164. 
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quantity are the errors^ while the differences of the measures 
from their arithmetic mean are called the residuals. We shall 
now show that the probability that the error should lie 
between prescribed values a and h is not equal to the prob¬ 
ability that a residual should lie between a and h. 

Let the measures be denoted by M^, Mg, . . their 
arithmetic mean by the true value by m, and let the 
residuals be 

= ^2 = 7 / 10 - Mg, . . . , 

while the errors are 

= m - Mj, €2 = 7/1- Mg, .... 

Adding the last equations we have (denoting the sum of n 
quantities by square brackets) 


[e] = nm - [M] = nvi - nm^. 


Therefore 

H 

7//.o = m-LJ 


and 

[ 5 ] 

t? = m - - M, = €, 

1 71 ^ ^ 

_w 

n ' 


71-1 1 

1 

or 

^ =-€- €0 - . . 

^ n ^ n ^ 

n 


Thus the residual is expressed as a linear function of 
the errors. 

Now let h denote the modulus of precision of the errors, and 
h! the modulus of precision for the residuals: then the formula 
for the precision of a linear function of errors gives 



71-1 


so the probability that a residual lies between v and v + dv\a 




nh^ 

'^-^dv. 


Since 


J^_Mandi = cr3 



we have therefore 
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and the probable error or qoartile of the errors is 
Q = 0-67449(r 

and therefore the standard deviation of the arithmetic mean 
of the n observations, which is is while the 

probable error of the arithmetic mean is 0*67449^ 

These are generally known as BesseVs formulae. 

Similarly, the mean absolute deviation r; is given in terms of 
the absolute values of the residuals by the equation 

so that in terms of the absolute values of the residuals we have 


‘"-Ay/ 


and the probable error of a single observation is 

while the probable error of the arithmetic mean of n observa¬ 
tions is 0*84535[lv|]. 

This formula is due to C. A. F. Peters.* It can be more 
readily computed than Bessers and is in general sufficiently 
accurate. 

105. Effect of Errors of Observation on Frequency 
Gurves. —Let a large number N of individuals be measured as 
regards some attribute, and suppose that the number found 
to have measures between x and x + dxvA TSy(x)dx. Suppose, 
however, that the measures are known to be vitiated to some 
extent by errors of observation, each measure being liable to 
error with a modulus of precision and suppose that, in 
consequence of these errors, the number of individuals having 
a true measure between x and x^-dx not TSy{x)dx but 
^ ♦ A$t. Naeh. 44 (1866), p. 29. 
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'Su{x)dx, It is required to find the function the function 
y(x) and the modulus h being known. 

There are actually individuals having the measure 

between t and In consequence of the errors of measure- 

menti these contribute a number 

Jir 

to the measures between x and x + dx. Therefore the number 
of measures observed between x and x-^dxvi 



mJtT J^tc 



N/t r roo 


or 

-j-cbj 1 6"^^ ^ u[x + s)ds, 


or 

^^dxj + . . 

. jcfe. 

or 

N&:|w(«) + + • • • 

}• 


Thus the functions y(p') and «(«) are connected by the relation 

2/(a:)=«(a;)+^M» + g^M*'(a;) + . . . , 

which readily inverts into 

u{x) = (^') +. . . • 

This equation determines u(x) in terms of y(x). 

Miscellaneous Examples on Chapter VIII 

1. The following frequency distribution was obtained by counting 
the number of letters per line of a book. Calculate the mean value and 
the standard deviation, and indicate what might l>e expected to happen 
as the number of observations is increased. 


Number of letters (n) 32 

33 

34 

35 

36 

37 

38 

39 

Frequency (f) 1 

2 

2 

10 

23 

31 

42 

54 

(n) 40 

41 

42 

43 

44 

45 

46 

47 

(J) 46 

55 

35 

28 

16 

10 

2 

2 


(Edin. Univ. Honours Exam., 1918.) 

2. Compute the mean height and standard deviation from the 
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following data by the method of summation, verifying the result by the 
root-mean>square method. 


^ in inches (h) 

64-5 

66-5 

66-6 

57*5 

68-5 

59-6 

60-5 

61-5 

Frequency (f) 

2 

4 

13 

36 

69 

159 

271 

326 

(h) 62-6 

63-5 

64-5 

65-5 

66-5 

67-5 

68*6 

69-5 

70-5 

if) 366 

326 

229 

157 

82 

32 

15 

9 

4 


3. Calcuhite the j^robability that in a given interval of time there 
will be a given number of “calls” at a telephone exchange, and the 
probability that a subscriber will be kept waiting a given time. 

[Cf. A. K. Erlang, Nyt Tideskrift for Math, 20 (1909), p. 33.] 

4. A vector € is the resultant of a very large number n of elementary 
vectors, each of given (small) length, whose directions are distributed at 
random in all directions in the plane. Show that the probability that 
the resultant vector € should have a length between r and r + dr is 

rdr 

- 2«r* 


where a is independent of f. 

[This result is of importance in the theory of the Brownian motion, 
and also in connection with the scattering of p-rays by matter; tho 
formula is due to Lord Bayleigh.] 

ADDITIONAL REFERENCE 

H. R. Hulme, “ Report on the Statistical Theory of Errors ”, Monthly 
Notices B.A.8 .100 (1940), p. 303. 



CHAPTER IX 

THE METHOD OF LEAST SQUARES 

106. Introduction.--!!! the present chapter we shall be 
concerned with one particular kind of frequency distribution, 
namely, the distribution of the measures of an observed 
quantity, when these measures differ from each other owing to 
accidental errors of observation. 

The deduction of the normal law of frequency given in the 
preceding chapter is applicable to this particular ^stribution; 
but alternative deductions have been given which depend on 
special assumptions regarding errors of observation, and which 
are in the highest degree interesting and worthy of study from 
the point of view of axiomatics. We shall therefore now make 
a fresh start with the theory. 

107. Legendre’s Principle. —In the mathematical dis¬ 
cussion of the results of observation, it is required to derive 
from the data the best or most plausible results which they 
are capable of affording. When the quantities which are 
observed directly are functions of several unknown quantities 
which are to be determined, the problem can generally be 
reduced (as will be seen later) to a formulation such as the 
following: 

It is required to find values for a set of unknown quantities 
x,y,z, . . . in such a way that a set of given equations 

’Oi® + Jjy + «!« +. . .+/i< = Wi, 

a^ + l^ + e^-{-. . .+f^ = n^, 

• ••••• 

+«««+• • 

(called the equations of condition) may le satisfied as nearly as 

(bSU) 209 s 
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possihle^ when the number s of eqtmtiom is greater than the 
number of unknowns x, y,z, . » t, and the equations are not 
strictly compatible with each other. 

By saying that the equations are to be satisfied as nearly as 
possible we mean that the quantities 

E2 = ajja3 + &2?y + C22! + . . 

which we shall call the errors, are to be as small as possible. 
We shall, for the present, assume that the equations are equally 
trustworthy, i.e. that the quantity, which is more precisely 
defined later as weight, is the same for each equation. 

In 1806 Legendre* suggested for the solution of this problem 
a principle which may be thus stated: of all possible sets of 
values of X, y, «, . . ., the most satisfactory is that which renders 
the sum of the squares of the errors a minimum ; that is, 

Ei^ + Ea^H*. . . + E,2 

is to be a minimum. 

For the present we shall simply accept this as a convenient 
working principle which serves the intended purpose: later in 
the chapter (§§ 110,115) we shall examine different attempts 
which have been made to deduce it from other principles which 
have been regarded as more evident or better fitted to serve 
as fundamental axioms. 

108. Deduction of the Normal Equations. —Assuming 
Legendre's principle, we have now to find the values of x, y, 
z, . , , which make 

Ei2 + E2* + . . . + E,® 

a minimum. If we use the notation [ ] as a symbol of 
summation so that, e.g., [^a] = + «/+. . . + a^, lab'llaf)^ 
+ ^ 2^2 + • • • + the sum of squares is 

+ [65]y* + \cc'\z^ +. . . + 2\ab’\xy + 2 [«c]a? 2 ! +. . . - 2[an]:r 

- 2\bn'\y -. . . + \nn\ 

* Nmvelles Mithodes pour la dilerminoUion des orhites dea comktea, Paris, 
1806, p. 72. 
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The unknowns x,y,z, . . . are to be chosen to make this a 
minimum, so that the derivatives with respect to a;, y,. 
must vanish. Thus a minimum value is obtained when the 
unknowns are calculated from the equations 

{ [aay. + \ah'\y + {ac\z + . . . = [rm], 

[rt6].c + [hh]y + \hc]z +. . . = [hn\ 

These are known as the normal equations: from them the 
valuesof . . . are to be determined by ordinary algebraic 

analysis. Evidently, in order to form the normal equation with 

respect to any one of the unknowns^ we must midtiply each of the 
original equations hy the coefficients of the unknown in this 
equation and add together all these products. 

The method which has been followed was named by Legendre 
the Method of Least Squares. 

In deriving the normal equations from the equations of condition we 
can use tables of quarter squares, for we have not only 

\aa\ . . + a,*, 

but also [ah] = | {[(a + &) (« + h)] - [aa] - [66]}. 

As a check on the computation of the normal equations, it 
is well to calculate the sums 

(Tj = 4- +. . . '^'fi 4" n-yi 

<rj, = fl«4-6«4-C^4-. . .4-/«4-71„ 

and from these to calculate the sums [ao-], [6<r], . . ., [fr]. 
These, with the sums [aa] . . . [ff] which have already been 
computed, should satisfy the equations 

( [^a] 4-[afe] 4-. . . 4 -[a/] 4-[a^i] = [rto-], 

[«/] + [¥] + • • • + [#] + [«/] = [/<"]• 

The verification of these equations serves to assure us of the 
accuracy of our calculation of the normal equations. We shall 
discuss later (§§ 117-121) the systematic solution of the normal 
equations and the controls connected therewith. For the 
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present we shall regard the solution merely as a matter of 
elementary algebra. 

Ex, 1 .—Find the most •plausible values of x and •y from the equations 

4*91JB-69-0y= -339-8 
2-72a;- 2-7i/= - 47-5 
0*05a; +32-42/= 262-5 

-2-91ic +27-72/= 152-9 

-4-77®+ 1-42/=- 27-9. 

We shall first find the normal equation for x 
multiplication table. Multiplying each equation by 
in it, we have 

24-lla;-289-7^= - 1668-4 

7- 40a;- 7-3y= - 129-2 

1-62/= 13-1 

8- 47ic- 80-62/=- 444-9 
22-75a:- 6-72/= 133'1- 

Adding, we get 

62-73a;-382-7y= -2096-3. 

This is the normal equation for x. 

We shall now find the normal equation for •y by use of a table of 
squares. AVe have 


by use of Crelle’s 
the coefficient of x 


(1) 


\ad 

\bV 

[a + 6, a + 6 


= 4-912 + 2-722 + 2-912 + 4-772 = 62-73 
= 592 4- 2-72 + 32-42 + 27.72 + 1.42 = 5307*3 
= 54-092 + 32-452 + 24-792 + 3-372 

= 2925-73 +1053 + 614-54 +11-36 
= 4604-63 


[6 + 71, 6 + w] = 281840-10 
[n, 7i] = 210783-36. 

Therefore 


[a6] = i{4604-63-62-73-5307-3} = -382-7 
M = + 71, 6 + ?i] - [66] - Ml = 1(65755-44) = 32877-7, 

and the normal equation of y is 

- 382-7a; + 5307-37/ = 32877-7. (2) 

From (1) and (2) we find 

a; = 7-81, 7/ = 6-76. 


The computation of the normal equations and the check on the 
computation may be earned out simultaneously as in the following 
scheme; the coefficients in the nonual equation for x being read from 
the row (3), and the coefficients in the normal equation for y from the 
row (4). In each table a row is formed for each of the given equations, 
and the columns are added. The sum of the first three columns should 
then be pqual to the sum of the last column. 
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na. 

nb. 

nn. 

n<r. 

-1668*418 

20048*20 

116464*04 

133843-822 

- 129*200 

128*26 

2266*26 

2256*300 

13*126 

8606*00 

68906-26 

77424-376 

- 444*939 

4236*33 

23378-41 

27168-801 

133*083 

- 39*06 

778-41 

872-433 

- 2096*349 

328V7-72 

210783-36 

241664-731 

--—iu— 
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Ex. 2 .—Find the moat plauaihU values of x and y from the equations 

35+ i^=3*01 
2a;- y = 0*03 
x+3y = l-02 
3a; + 2/= 4*97. 

Ex, 3. — The following data were read from a graph showing the probable 
stature of son for given fatheads stature^ denoted by S inches and F inches 
respectively: 

S, 66-7 66-8 67-2 69-3 69*8 70-6 70-9 

F, 62 64 65 69 70 71 72. 

If S and F are connected by the relation S^a + bF^ determine by the 
Method of Least Squares the most probable values of the constants a and b, 

(Edin. Univ. Actuarial Diploma, 1922.) 

109. Reduction of the Equations of Condition to the 
Linear Form. —In many cases the original data of the problem 
are not immediately expressible as a set of linear equations of 
condition. Suppose, for example, we require to find the most 
plausible values of x and y from a set of equations 

where . . .,f, are known functions and Mj, . . m, are 
measures derived from observation and liable to accidental error. 
In this case we first find, in any way, an approximate pair of 
values of x and y, say x, y. Putting a;=» + y -t 17 , we have 

fi{^> y) =/i(*+f y+»?) = y )+ 

ox ay 

approximately. 

The equations of condition thus become 


y) f , yi(^> y) 




9/.(». y)f , y) ,, f.- -A 


and these, being linear in (f, 17 ), can be solved by the process 
already described. 


Ex,—Determine the most plausible values of the rectangular co¬ 
ordinates (a;, y) of a point P, given that its measured distances from the 
points (0, 0) (7, 0) (0, 6) are respectively 6*40, 4*47, 6'38, and that these 
measures are of equal weight. 
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We readily find graphically that approximate values are xs= 6, y=4,. 
Therefore writing ®=sa; + ^, y = y + the equations of condition become 

(6 + ^)® + (4 + = e'40, 

(2-|)*+(4 + >j)*}i = 4-47, 

{(6 + |)a + (2-i})Sll = 6-3S, 


or 


5^ + 4i) 


V41 


-G.40- V41= -0.0031, 


-^,^ = 4.47- V20= -0-0021, 


V 20 

=5.38- V'29“-0.0052. 


I V29 

These equations, which are linear in ^ and are now treated as ordinary 
equations of condition as in § 108. 


110. Gauss’s ''Theoria Motus”; the Postulate of the 
Arithmetic Mean. —We now proceed to consider the various 
attempts that have been made to place the Method of Least 
Squares on a logical foundation. 

The first writer to connect the method with the mathe¬ 
matical theory of probability was Gauss.* His treatment 
assumes as a postulate that when any number of equally good 
direct observations M, M', M", ... of an unknown magnitude 
X are given, the most probable value is their arithmetic mean. 
Gauss’s deduction of the law of error will be given in § 112: 
for the present we shall consider the postulate in itself.f 

This postulate must be distinguished from the statement that as the 
number of observations is increased indefinitely, the arithmetic mean 
tends to the true value of x : this latter statement is indeed correct, % 
and is true of an infinite number of other functions besides the arithmetic 
mean: § but we cannot infer from it that the arithmetic mean gives the 
most •probable result when the number of observations is finite. 

In recent years the postulate of the arithmetic mean has 

* Theoria motus corporum mlestiumy Hamburg (1?09), § 177. Gauss 
mentions that he had used the method from the year 1795. 

t For a critical discussion see P. IMzzetti, “ I fondamenti mat. per la critica 
dei risiiltati sperimentali,” Atti della M. Univ, di Genova per il cerUenario 
ColomMano, 1892, pp. 113-334. 

t Indeed we may define the true value of a physical quantity as the limit to 
which the mean of n observations tends when n increases indefinitely, 

§ E,g. ~ > 0, where / is any odd function, when the number of 

observations is increased indefinitely. 
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. been exhibited as a deduction from other axioms of a more 
elementary nature,* which may be formulated thus: 

Axiom L —The differences between the most probable value 
and the individual measures do not depend on the position of 
the null-point from which they are reckoned. 

Axiom II —The ratio of the most probable value to any 
individual measure does not depend on the unit in terms of 
which measures are reckoned. 

Axiom III —The most probable value is independent of 
the order in which the measurements are made, and so is a 
symmetric function of the measures. 

Axiom IV. —The most probable value, regarded as a function 
of the individual measures, has one-valued and continuous first 
derivatives with respect to them. 

From these four axioms we can derive the postulate of the 
arithmetic mean in the following way: 

Suppose the most probable value is expressed in terms of 
the n measures x^ by the function f{x^^ x^, . . x^). 

Then by the theorem of the mean value in the differential 
calculus (which by Axiom IV. is applicable), we have 

fcj, . ... kxn) =yi;o, 0,. . 0) + +... + ^[|£]> 

where the square brackets denote that every ^ is to be replaced 
by Okx, where 6 lies between 0 and 1. Now by Axiom IT., the 
left-hand side = kf{x^, x^^ . . ., Xn ): and since by the continuity 
of the function /, the equation 

f{kxi, fes,, . . lex^ = kj\xy 


must hold in the limit when Ic is zero, we have 


Thus we have 


/( 0 , 0 , . . 0 ) = 0 . 


or, dividing by k. 


• •• 



• Cf. G. Schiaparelli, Rend. Ist. Lomhardo, (2) 40 (1907), p. 752, and Jst. 
JVocA. 176 (1907), p 206 ; U. Broggi, VEnseignemenA 11 (1909), 

p. 14; R. Schimmack, MaJth. Ann. 68 (1909), p. 125. 
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In this equation make : then each of the quantities J 

tends to a value which is independent of the a:’s, so we can 
write f{x^, Xn)=c^Xj^ + . . . + c„a:„, where the c’s are 

independent of the x’b. By Axiom III., the c’s must all be 
equal, so 

x^y . . av,) = c(^x^ + X 2 +, . . + a^n), 

and since Axiom I. gives the equation 

J\x^ + h, . . ., a?„ +A) =/(»!, . . .,!*«) +A, 
we have enh = A, 

1 

so c = — 

n 

Therefore /(aii, % . . ., a*) = ^(*1 + a-g +. . . + a:„), 

7h 

which expresses the postulate of the arithmetic mean. 

111. Failure of the Postulate of the Arithmetic Mean. 
—For certain types of observations the postulate of the 
arithmetic mean is not valid: in particular, for visual photo¬ 
metric measurements in astronomy.* In these the quantity, 
of which measures are made, is the ratio of the brightnesses 
of the two stars. Suppose that x is the true value of the ratio 
for the two particular stars, and let /g, In be different 
measures of it. The observations being supposed to be made 
visually, we take as our starting-point the Weber-Fechner 
psycho-physical law on the sensitiveness of the human retina 
to differences of light intensity: this asserts that increment of 
sensation is proportional to relative increase of excitation, or 

SE = constant x y, where E measures the sensation of light and 

f is a physical measure of its intensity. If E and Eq denote 
the intensities of perception corresponding to the brightnesses 
/ and X, we have therefore 

E-Eo = clog-. 

ilf 

The quantities E-E, represent the errors of observation; 
denoting them by Aj, . . ., A,„ we have 

Ai = 6log|, Ag^clog^, . . . A„ = clogA 

• Of. Seeliger, Ait. Nach. 1S3 (1893), col. 209. * 


(D8U) 
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The A’s appear to obey the normal law of facility: so that the 
most probable value of is that which makes 

g—. . . +An*) 


a maximum, i.e. which makes 

V + V + . . . + A„*' 
a minimum; this gives 

0A, 3A« . 3A„ f. 

or log^i + log-J + . . . + log^* = 0, 


80 


IJ'Z • • • Ifi _ 1 


or . . . InYK 

This formula for determining the most probable value from the 

observations was first given by Seidel in 1863.* 

Ex.—Show that if the prob ibility that the error of a measurement will 
he between x and x + dx is c{ 1 + [^| ^}”Va, where c and h are constant, 
then the arithmetic mean of two measurements is not as reliable as a single 
measurement, and in fact is the least probable value among all possible 
weighted means. (£. L. Dodd.) 

112. Oauss’8 ‘^Theoria Motus” Proof of the Normal 

Law. —We shall now show how, when the postulate of the 
arithmetic mean is granted, the normal law of error can 
be deduced. 

Suppose that for the measure of an observed quantity, the 
probability of an error between A and A + e^A is </>(A)(fA, so 
that <^(A) is the relative frequency of error. If e denotes the 
least quantity to which the measuring-instrument is sensitive, 
we can suppose that the possible values of any measure proceed 
by steps of amount €, and the probability of an error A may be 
taken as <^(A)£. 

It should be noticed that it is here tacitly assumed that the probability 
of a certain deviation depends only on the magnitude of the deviation. 
If this assumption is not made, a law of facility much more general 
than the normal law may be deduced.'!’ 


* mnchen. Abh. 9 (1868). 

t Gf. Poincar4, Cdleul, des prdb. p. 156, and B. Meidell, Zeils. fUr Math, u, 
Phys. 66>(1908), p. 77. 
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How suppose that a number 8 of measures ... are 

taken of a quantity x whose true value is p. The errors are 
A = M - A' = M' - p, etc. The probability of the error in the 
first measurement being M-p is the probability 

of the error in the second measurement being M'-p is 
<^(M' -p)e, and so on. The probability that a set of measures 
(M, M', M' . . .) will occur is therefore 

€*.<^(M-p)<^(M'-p)^(M*-p) .... 

If now we assume that, before the observations are made, all 
values of x are equally likely to be the true value,* it follows 
from Bayes’ theorem in Inductive Probability that, when the 
observations have been made, the probability of the true value 
of X lying between p and p + dp is 

<ft(M -p)«^(M' -p)^(M'’ -p) 

I </>(M-p)^(M'-p)^(M''-p) ... dp 
•' — 00 

and therefore the most prohaUe liypothesis regarding the true 
value of X is that x has that value which makes 
<l>{M.-x)<f>{M.'-x)<t>{M.''-x) . . . 
a TTiavinui m , i.e. that value of x for which 

2 ^1og^(M-a;) = 0. (1) 

Adopting the postulate of the arithmetic mean, we see 
that equation (1) must be equivalent to 

a!=-(M + M' + . . . ) 
or S(M - x) = 0. 

Therefore ^ log </)(M - it?) = c(M - x), 

where c denotes a constant, and therefore 

where A denotes a constant, so 

<^(A) = 

* It is not necessary to make this an independent assumption, for it may be 
deduced as a consequence of tbe postulate of the arithmetic mean, which will 
be introduced presently. 
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Since the sum of the probabilities of all possible errors is 
unity, we have 

r •f>{A)dA = l 
J _oo 


or 

But 


so 


l = Ar 

J .00 

f e-v*dy=Jv, 

J .00 


Writing A for ^/(Jc), we have 




which shows that the distribution of the measures about the 
true value is a normal frequericy distribution, 

113. Gauss’s **Theoria Motus” Discussion of Direct 
Measurements of a Single Quantity. —Assuming, then, that 
in the measure of an observed quantity the probability of an 
error between A and A + dA is 




where h is the modulus of precision, we note that the modulus 
of precision affords an indication of the weight which must be 
attached to an observation when it is to be combined with 
other observations. Thus in observations to determine the time, 
made with the meridian circle, the modulus of precision is less 
for a star very near the pole than for an equatorial star, so in 
combining the results of an equatorial with a circumpolar 
observation we should attach more importance to one than to 
the other. 

Suppose now that s measurements are made of a quantity a?, 
the measures being a>s, and the corresponding 

meapures of precision being Aj, A^, . . ., A,. Let p be the true 
value of X, so the errors are p - jp - iCg, . . . The probability 
of precisely this set of errors is therefore 
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Now the most probable value of « is that value of p which 
makes this expression a maximum, i.6. it is the value of x 
which makes 


h^(x - +. . . + Aj*(a; - »,)* 

a minimum. It is therefore given by 

Aj*(a: ~a:j) + h^{x - +. . . + h^{x - x,) = 0, 


or 


x = 


hj^os, + h^x„ + . . . + 

V+V+TTTTV" 


( 1 ) 


Now we have seen (§ 89) that the modulus of precision 
H for any linear function + + - errors 

Aj, Ag, . . . whose moduli of precision are A^, . . . is given ■ 
by the equation 




.+ 


O' 


( 2 ) 


Therefore the modulus of precision of the quantitv x given by 
(1) is H when 


1 _ 1 fh* O 

H*“(2A*)nV'^V'^‘ 


O/’ 


so H* = /i.i* + 0 + * • • + 0- (3) 

Now let h be the modulus of precision of certain observa¬ 
tions which are taken as a standard for comparison of precision, 
and write 


w, 




A*’ 






o 

A®' 


W = 


H! 

A*' 


( 4 ) 


The equations (1) and (3) become 

_ WjOJj + WgVg + . . . + w^, 

~ 'Wi + W^ + . . . + w, ’ 

W = W>1 + W2 + . . . + w^ 

These equations are evidently analogous to the equations 
which determine the portion of the centre of gravity of 
particles of weights w^, . , w, placed at the points 
Xi, fl!j, respectively, W denoting the weight of the 

equivalent body to be placed at the centre of gravity. On 
this account Wj, w, are called the weights of the 

observations x^, x^ . . .,x, respectively; the weight W of the 
result is the sum of the weights of the separate observations. 
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Hence we deduce at once the following results: 

1. The mean of 8 equally good observations has a weight s 
times thcU of any one of them. 

2. If w denotes the weight of a determination p which is 
deduced as the most probable value of x from a certain net of 
observations, and if we adjoin a new observation x—p-\-u of 

u . 

wmght 1, then the most probable value of x is p + —^ and it 

has the weight {w+1). 

3. If an observation x has the weight w, then on multiplying 

w 

it by any number X the^ new value Xx has the weight For 


if the probability that x lies between x and + efo? is -r-e-'^^^dx, 
then, denoting kx by y, the probability that y lies between y and 


h h 

y-¥dy is dy, so the modulus of precision for Xx is -r, 

Aijir \ 

w 

and therefore the weight of Xx is It follows that: 

4. Obsei'vations of different weights can be treated as observa¬ 
tions of unit weight by multiplying each equation of condition 
by the square root of its v)eight (§ 114). 

Ex.—If + • • • + ® Imear function of independent 

estimates . . ., Xg of a number x, determine • • •y ag 

subject to the condition a, + 03 + * * * + ^0 that the weight of the 

linear function may be a maximum. 

From equationB (2) and (4) above we see that ^ ~ + ■ 

where W is the weight of the linear function and . . ., Wg are the 

weights of the independent estimates. Therefore we must have 

subject to 0 =s + da^ +. . . + da,, 

and therefore ^ =. . , ~ 

Wg Wj^ + W^-\-. 


1 a,* a.2 a** 

^2 Wg 


Wy W^ 


. + ag 
• +Wg' 


80 




Wj + tCgH-. . .+Wg 

Thus the estimate for x which has the greatest weight is 
w^g^>^tg2^2"^■ ■ • + w^g 

“ irj + W2+. . . + Wg 

This agries with the value of x given by the Method of Least Squares. 
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114. Gauss’s "Theoria Motns” Discussion of Indirect 
Observations.—Now suppose that a set of unknown quantities 
x,y,z,.. . are to be determined from s measures n^, n^..n„ 
the unknowns being connected with the measured quantities 
by the equations of condition 

la^x + \y + e^z-\-. . .+fytr=ny, 

4 • • • • • • 

+ % + + . . .+/^ = n^, 

where the coefficients a,, 6 ^,. . . are supposed to be 

known accurately, while the measures are liable to accidental 
errors of observation. Suppose that for the observed quantity 
rij the measure of precision is \y for the observed quantity 
the measure of precision is Ag* probability of an 

error in is where is the smallest quantity to 

which the measuring-instrument of is sensitive, so that the 
measures may be supposed to proceed by steps of amount 
Then as in § 113, the probability of the concurrence of errors 
Aj^y Ag,. . . is 

and the most probable values of Xy y, j?, . . ^ are those which 

make this expression a maximum, when 

A^ = a^x + \y + c^z + . . •+fit-n^y 

Ag = «g^ + &ay + C 2 ^^l + . . . 4 -/ 2 ^ - Wg, etc. 

The values [x, yy Zy , . ., t) must therefore be determined 
from the condition that 

h^A^ + h^A^ + . . . + hg^A^ 

is to be a minimum. If w^y w^y. . . , iVg are the weights of the 
observations (which are proportional to the squares of the 
moduli of precision), we must therefore have 

+ t^gAg* +. . ,-¥wJA? 

a minimum: that is, in the notation of § 108, 

+ [i^cc] 2 * +. . . + 2 [?t?a 6 ]<ry +. . . 

- 2 [ioa 7 i]a? -. . . + [i 4 ;nn] 
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must be a miniumm, and therefore the unknowus x, y, st, . 
must be determined from the equations 

4-+ [wac]« +. . . = [i/;a7i], 
[wab]x-\-[wbb]y + [wbcyi + . , . = [ 2 ^ 671 ], 

These are the normal equations. 

It is evident that an observation of weight w enters into 
equations exactly as if it were w separate observations each of 
weight unity. The best practical method of accounting for 
weight is, however, to prepare the equations of condition by 
multiplying each equation throughout by the square root ol 
its weight: the resulting equations then have their weights all 
equal, as we have seen (§ 113). 

115. Laplace’s Proof and Oauss’s ‘‘Theoria Combina- 
tionis ” Proof. —The Method of Least Squares was established 
in an entirely different manner by Laplace in 1811 * and by 
Gauss (who to a considerable extent adopted Laplace’s ideas) in 

1821-23.t 

The common principle of these and various modem proofs 
which have been developed from them may be stated thus: 
Suppose that for s linear expressions 

a^ + bgy + c^ + . . . 

we have respectively the independent estimates n^, n^, . . ., n,, 
derived from observation, the number s being supposed greater 
than the number of unknowns x^y.z, ... . 

For the quantity 

Ai(ai« + 5iy + Ci« + . . .) +A2(a2a? + &2y +V + - • •) + • • • 

+ A*(a4a; + 6,y + c^ + . ..)... (A) 

Thitirieanal. deaproh. Livro 11. cliap. iy. (1812), following a memoir of 1811. 
t Thtoria e<miHnaJti(mia ohaercationum vrrorihm minimia ohnoxiae^ Werlee^ 
Band lY. p. 1. A French translation by J. Bertrand was published at Paris 
in 1855. 

Gauss in a letter to Bessel of February 28,1839, admitted that he had changed 
his views regarding the establishment of the Method of Least Squares since the 
publication of his Theoria Motw in 1809, having abandoned the metaphysical ** 
basis on which the Method was founded In that work. 




THE METHOD- OF LEAST SQUAEES 226 
we have therefore the estimate 

+ • • • + 

Suppose the A.'s are chosen so that in the expression (A) the 
resultant coefficient of x is unity and the resultant coefficients 
of y, . . . are all zero, so that 

is an estimate of x. The problem is to lay down such further 
conditions for the A’s as will secure that this is the test posdUe 
estimate for x. i\ ow this can be done if we take as a funda¬ 
mental idea the notion of the weight of an estimate. Let n be 
an estimate, derived from observation, of the value of some 
quantity: then we shall suppose that with this estimate is 
associated a number w which will be called its weight. We 
shall further establish (in various ways according as the method 
of Laplace or of Gauss or of more modern writers is followed) 
that if w^, w^, . . ..Wg are the weights of the estimates 
. • ., Thg for the linear expressions (1), then the weight of the 
estimate for the expression (A) is W, where 


w ^ 


w. 


^ 4 -. 






w' 


Finally, we shall define the best possible estimate for x to be 
that whose weight is greatest. With these presuppositions the 
best estimate for x is 

X ~ AjTIj +. . . + hgUgy 

where the A’s satisfy the equations 


(B) 



.+v,=i' 

AjJj + A 2&2 + • • 

• +—0 

Aj^Cj^ + A2^2 ■ • 

. + V. = 0 

Also we have 

• • J 

” Ajd^Aj A2rfA2 

1 1 • • 

II 

i 

+ 


Wg 

These equations give at once 



(C) 
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or 


Aj++. 


:■ • -n. 

h. . . = oj 


A, + itw/ig + ij!iOgh^ + +. 

Substituting for the A’s from (D) in (B) and (C), we have 
0 = + + + + . . . 

0 = 1 ++/x'"[wa] +. . . 

0= /^i[i^a6] +/[i^;66] ++. . . 

0= i^wac\ +ii\whc\-\-ix[mc] +. . . 

etc. 

Eliminating the /a*s from these equations, we have 


(D) 


0 = 


\wan\ 

\wad\ 


\wb7i\ 

\wba\ 


0 [waf] \whf] 


[wfn\ 

[wfa] 

• 

WA 


or 


\%van\ 

\iol)ri\ . 

• [wfn\ 

[im6] 

\whl\ . 

. [wfh] 

\wac\ 

\%iJbc\ . 

• [w/c] 

• « 

{waf\ 

[whf] . 

• WA 

\wacC\ 

[;ivhci\ . , 

. [wfa] 

[w6] 

[labb] . , 

. . [wfb] 

\wac\ 

\ 

[wbc] . , 

, . [wfe] 

• • 

\waf] 

[whf\ . , 

• • WA 


As this is the value of x obtained from the ordinary normal 
equations of the Method of Least Squares, we see that the 
present investigation leads to the establishment of that 
Method. 

116. The Weight of a Linear Function. —It remains to 
show how the equation 


+-^ + . 

10 ^ 


• + 




(E) 


W 10^ Wg 

has been obtained by the different writers who have furnished 
proofs of this type. Laplace obtained it by investigating the 
value of a linear sum of errors 

. + . . . + AgCf 




227 


THE METHOD OF LEAST SQUARES 


when each of the errors has a definite law of facility, say the 
probability that the Hh error lies between €, and €r+de, is 
il>(er)der. This investigation we have given in a form some¬ 
what different to Laplace’s, in § 86: as is there shown, it leads 
to the result that if 


f si?<l>{x)dx 

•'O 


is denoted hj A®, then the probability that the linear sum 
+. . . + Xffig lies between -1 and +1 is, when s is large and 
under certain conditions, 




•'0 


4SX*/;* 


du, 


and whatever I may be, this is a maximum when 
Ai%i^ + • + 

is a minimum. Defining the weights as inversely proportional 
to . •, A/, we obtain the equation (E). It will be 
noticed that Laplace’s method requires that the number of 
observations should be very large, or else that the elementary 
errors should follow the normal law of facility. 

Gauss, on the other hand, writing 

+ • • • ” 

' . • • . • 

€s=^a^x + h^/ + . . .-ns, 

obtained accurately 

(C ~ (AjWj ’4“ • • • "1" AgTlg) "i" (^1^1 ks^s^, 

where as before the A’s are multipliers which satisfy the 
equations 

Ai^Kj +. • • + ~ 1 > 

- Aj&j +. . . + = 0, 

etc., 

and therefore reduce the coefficient of sc to unity and the 
coefficients of y, 21, ... to zero. He then assumed that the 
importance of the error AjCj +. . . + A^«, i.«. the detriment of 
which it is the cause, may be represented by +. . . + AaC,)®. 
That is, (Aj^i +. . . + A«e«)® is a function whose mean value is 
to be made a minimum. Thus instead of securing* that the 
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probability of a zero error is to be a maximum, ae he did in 
the Theoria Motus proof, Oauss now endeavoured to diminish for 
each unknown quantity, the probable value of the square of the 
error committed. Now 

+ • • • + + 2SA2A2€2^2* 

The mean value of is 

y — oo 

and the mean value of €^€2 is zero. 

Hence the quantity to be made a minimum is 

+ - • + 

as in Laplace’s proof: and it appears from Gauss’s proof that 
this formula can be obtained by making minimum the mean 
square of error (which is the average of the true square of error 
for an infinite number of cases); or in other words the Method 
of Least Squares gives a result such that, if the whole system of 
observations were repeated an infinite number of times, the 
average value of the square of the error would be a minimum. 
The postulate of the arithmetic mean, on which the Theoria 
Motus proof was based, is not needed here. Gauss himself 
decidedly preferred this proof to his earlier treatment.* 

Some modem writers have derived the equation (E) directly 
from assumptions regarding weight, which is taken as a funda¬ 
mental notion, and have thereby succeeded in establishiug the 
Method of Least Squares without any appeal to the ordinary 
theories of probability of error. This may be done in the 
following way; f 

Let 71 be an estimated value of a quantity x\ then we shall 
associate with this estimate a number w which will be called 
its weight, and we shall assume as an axiom that the weight of 
the estimate \n of the value of \x (where X is any number) is 
of the form wj\\), where /(X) is some function of X. If /i is any 
number, the weight of the estimate gXn of the value of is 

* Gf. letter of Gauss to Sohumaoher, November 25, 1844, in Gauss, Werke, 
9, p. 147. 

t OL Bernstein and Baer, Math, Arm, 76 (1915), p. 284. 



THE METHOD OF LEAST SQUAEES 229 


therefore wf{X)j\ii) : but it is also /(X/a) ; so the function / must 
satisfy the equation 

whence we have /(A) = X* 

where k is some number. Thus if the weight of the estimate 
n for X he w, the weight of the estimate Xn for Xx is 

wX^. ( 1 ) 

Next suppose an estimate I for x has the weight p and an 
independent estimate m for g has the weight q, and let the 
weight of the estimate I tor x + y he r. 

We shall assume as an ao'iom that r is given by an equation 
of the form 

^(r) = f (j9) + (2) 


this being in fact the definition of independence of the two 
estimates. Since the estimate Xl for Xj* has the weight p>f 
etc., we have 

\l/{rX^) = ^(2>X^) + 4'[qX^). (3) 

We have therefore to find the nature of the function \j/ for 
which equation (3) is a consequence of equation (2), whatever 
X may be. Regarding r as a function of p and we have by 
differentiating (2) with respect to p 




and by differentiating (3) 


.dr 


Thus 


^'{pX^) _ \//{rX^) 
and similarly the latter fraction is equal to 

f (?) * 

Therefore “ independent of p, and therefore since 

pk* is symmetrical with respect to p and X* we must have 

mm’ 
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a constant independent of p and X*. The function \[/ therefore 
satisfies the functional equation 

which shows that \l/{x) is a mere power of x multiplied by a 
constant^ and therefore \p{x) is also a power of x multiplied by 
a constant. This constant has no influence on equation (2), 
and we can therefore write without loss of generality 

if{x)—x\ (4) 

Next let % and be two independent estimates of x, each 
of weight 1. Then by (1), and are two independent 

estimates of \ x, each of weight ^: and therefore by (2) and 

(4), + \n^ is an estimate of x of weight r, where 



We shall now assume as an axiom that if two independent 
estimates of the same quantity are each of unit weighty their 
arithmetic mean is an estimate of the same quantity having 
the weight 2. We have therefore r = 2, and consequently 



or (* + 1)7 = 1. (6) 

Combining (1), (2), (4), (5) we see that if n^ is an estimate 
of with weight and if n^ is an independent estimate of 
x^ with weight then + *2^2 estimate of \^x^ + 
with the weight r, where 

( 6 ) 

Now suppose that n^ n^, . . 7?, are estimates of • • •, Xg 

respectively, each of weight 1. Then by repeated applications 
of (6), we see that + • + estimate of 

x^ + X 2 + . . .^Xg with the weight r where 

r^ = l^ + P + . . . + P = a, 

1 

so r = sy. 

Now we shall assume as an axiom that when the weights of 
Wj, . • ., n«are each unity, the weight of 711+712+* • • + ^« is 
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of the order p i.e. the product of this weight and s is always 
finite and bounded as 8->oo. Therefore 

i=-lory=-l. (Y) 

By (6) repeated and (7), we see that if Wg, rig are 
estimates of Xgrespectively.of v)eig}itsw^yW^, . . 

then AjTIj + +. . . + kgUg is an estimMe ofk^x^ + +. .. + k^g 

with the weight W, where 

1 A 2 \ 2 A 2 

4 = ^+:^+. . 

W w^ Wg 

This is equation (E), from which, as we have seen, the 
Method of Least Squares may be derived. 

117. Solution of the Normal Equations.—We shall now 
discuss the solution of the normal equations, which are equal 
in number to the number m of unknowns, and which we shall 
write 

+rti2.y +. . . + =Cp 

a^x ^a^f +. . . + a2m^ ==C2, 

«••••• 

where 

Let D denote the determinant ||ay^||, and let Apg denote the co¬ 
factor of apq in D. Then the solution of the above equations 
is known to be 

Lc^ = AjjCj + A2|^^2 "J" • . • 

Dy = A,2Ci +A22C2 + . . . +Ajn2p,n> 

• •••••• 

= A-yifix + A2m^2 + • • • + Amnfim* 

The problem is therefore to calculate D and its minors Apq. 
To effect this, by the repeated application of the theorem of 
§ 38, we reduce I) to a determinant of lower order: the process 
may conveniently be stopped when the reduced determinant is 
of the 4th order, so that we have, say, 


II 

ft 






^»i2ni 


^wigns 
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where M denotes an external factor, and each element Ipq has 
been derived from an original element apq by a succession of 
processes of the kind described in § 38. 

Now if Ipq is one of these 16 surviving elements, the 
minor Apq of D may be reduced, by precisely the same trans¬ 
formations as D, to the product of M and a determinant of the 
3rd order: indeed this reduced form of kpq may be derived 
from the above reduced form of D by merely forming the co¬ 
factor of hpq in it. Thus a single reduction-process furnishes 
not only D, but also 16 of the minors A^^. 

Taking for example the case where we have 6 unknowns, so that D 

is of the 6th order, we may reduce D by taking and in succession 

as the pivotal elements, and shall thereby obtain the minors 

A A* A* A A^ A^ A A-^ A 

38 -^34 ^36 -^30 ‘»-44 ■'^45 -^55 -^56 

those with an asterisk being obtained twic«. 

We can next reduce D independently by taking and in 

succession as the pivotal elements, and shall thereby obtain the minors 

■^11^12 “^15 ^22 -^23 -^25 -^41 ^42 -^43 ^45 -^61 “^62 “^63 “^65 * 


so that altogether of the minoi's 


All -^12 -^13 -^14 -^15 -^16 
•^22 “^23 *^24 “^25 ^26 
-^33 ^34 -^35 -^36 
-^41 ^46 ^46 
-^66 “^66 


we obtain fourteen of them once, three of them twice, and four three 
times. These multiple determinations serve as checks to the calculations. 
Ex. 1 .—To solve the eqmtions 



x + 3y — 2z 

-2v — 

0-5, 

3a! + 4 ^ - !}Z + 

1^-317 = 

5.4, 

- 2a: - 61/ +■ 3» - 

2ii + 2v = 

- .5-0, 


y-2z + 6w + 3v = 

7-6, 

- 2a; - 3y + 2«j + 3w + 4v = 
we hi'st form the determinant 

3-3, 


1 3 

-2 0 

-2 


3 4 

-5 1 

-3 


-2 -6 

3 -2 

2 


0 1 

-2 5 

3 


-2 -3 

2 3 

4 


and taking an as the pivotiil element, we have at once 


D«D' = 


-5 

1 

1 


1 

-1 


1 

-2 

5 


3 

-2 

3 

0 


-26. 
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We now find the oo^factors of the elements a^q of D, corresponding 
to the surviving elements in D'. For example the co-factor of 
- 6 in D, is evidently (§ 38) the co-factor of 6 ^= 1 in D', so 

Ajj3= -1 -2 -2=3. 

15 3 

3 3 0 

In this way we find the values 

^22- 13 ^32= 3 A42=-11 A52= 16 

^23== 3 ^33= 18 ^43= 9 Ag3=: — 4 

^24“ ~ 11 ^34= 9 ^44= 17 ^54= —27 

^26= 13 A35=- 4 A4b=-27 A56= 37. 

Now selecting a 24 as pivotal element, we form the determinant 



1 

3 

-2 

-2 


4 

3 

-7 

-4 


- 15 

- 19 

23 

18 


-11 

- 15 

17 

13 


and then determine the co-factors of the elements of D, corresponding 
to Cpq of For example, 

Ai 4 = 3 -7 -4 1 =52. 

- 19 23 18 

-15 17 13 

In this way we find the values 

*^21 ~ “* 1> A 31 — 19, A 44 = — 3, A32^=18, 

An= 52, A 22 = — 1, A23= 19, Aj5=18. 

Now taking as pivotal element in D we obtain the remaining 
co-factors : 

A 2 i = “ 1> A 24 = — 3. 

The solution of the above equations can be written down at once: 


Dfl! = 0*5A|i "f" 5*4Aoi — fi’OAa* -f* T'fiA^ + 3*3Ag| 

= 26 0-5.4-95-0-22-669-4 
or — 250;= - 37*5 ; whence a;= 1*5. 

T)y = 0*5A22 + fi*4A22 “ fi’^A 32 -J- 7*5A42 + 3'3A32 
= = 0-5 + 70-2 - 15-0 - 82-5 + 52*8 
or - 26y = 25*0 ; whence y = - 1*0. 

Da = 0-5 Ai 3 -h 5-4A23 - 5-0A33 + 7-5A^ + S-SAg^ 

= 9-5 + 16-2 - 90-0 + 67-5 - 13-2 
OP -26a=-10'0; whence a=0*4. 

Da = 0*5A,4 + 5-4A24 - 6 OA34 -I- 7 5A^ + 3-.3A34 
- 1-5 - 59-4 - 45*0 + 127*5 - 89*1 
or — 25 m = — 67*5 ; whence u = 2*7. 

Dv = 0*6A43 -f* 5*4A2g — 5*0A357*5A45-f- 3*3A4|^ 

= 9*0 + 86*4 + 20*0 - 202*5 + 122*1 
or - 25v = 35*0; whence v = - 1*4. 
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Ex, 2 .—Solve the equatume 

&x-‘Zy+*Iz+ + 2t; s= 356*8, 

- 3a; + 2y 4-2 m + 2t; = 60*6, 

7x + 2z + u— V = 167*0, 

a; + 2y+ «+ m— 6v=- 7*6, 

2a; + 2y- a—6 m+ t;= - 71*2. 

118. Final Oontrol of the Calculations.—^When the most 

plausible values have been foimd by solution of 

the normal equations, we can calculate the residuals Vy, . . v, 
defined by the equations 

• • • • • • !•. (1) 
a»»o+%o + ' • 

These residuals v are (as will appear later) required in order 
to determine the mean error of our results. Meanwhile we 
shall show how they may be used to furnish a check on the 
working hitherto. 

We have 

=* + [66]yo* +. .. + ^[cbb\x^^ +. . . - 2[an‘\x^ -.. . + [nn\ 

« xj^[aa]x^ +... + [af]t^ - [an]} + y^{[ah]x^ +... + [hf]t^ - [J/t]} 
+ . . + 

and by virtue of the normal equations this expression reduces 
to the last set of terms, so we have 

• -“OK + M- 

This equation may be used as a control for checking the 
accuracy of the whole set of computations, [v^] being computed 
directly by squaring and adding the residuals. We have 
previously (§§ 108, 117) described controls on the formation 
of the normal equations and on the calculation of the deter¬ 
minants involved in their solution. 

119. Ganss’s Method of Solution of the Nonnal Equations.— 

The method of solving normal equations given by Qauss * differs, in form 
at any rate, from the determinantal method described in § 117. It 
may described thus: 

From the normal equation in x, we find x in terms of the other 
variables and substitute this value in the remaining normal equations: 

* Theoria Oombifiationis, Supplementum, The method is fiilly described by 
Enoke, Berlin, ustrimomieGhe Jahrhueh (1835), pp. 267, 272, and (1886), p. 263. 
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this gives us the “first transformed system,’' which involves only 
y, and is (like the original system) axisymmetric. 

From the first equation of the first transformed system (viz. that 
equation which was derived from the normal equation in y) we find 
y in terms of z, . . and substitute this value in the remaining 
equations of the first transformed system: this gives us the “ second 
transformed system," which involves only z, . . ., and is also axi- 
symmetric. Proceeding in this way we obtain at last an equation which 
involves only t and from which t can therefore be determined: we then 
determine all the other unknowns, each from the equation which was 
used for its elimination. 

Thus if the original normal equations are 


ax-)rhy-\rgz = l^ 
kx + by +fz == Tti, 
gx +/y + cz = n, 

the set of equations which are used in Gauss's method for the final 
determination of y, z are 


ax+ hy + gz = I 


a h 
h b 

y+ 

a g 

h f 

z = 

a 1 
h m 

\a - 

h g 

{ la h 

■ 


h b f 
9 f « 


h b m 
9 f n 


(1) 


Now if T denote the quadratic form which represents the sum of the 
squares of the errors, namely 

T s ax^ + by^ + cz^ 4- 2fyz + 2gzx + 2hxy - 2lx — 2my - 2nz + w, 
it is known that T can be expressed as a sum of squares in the form 


T=^iax + hy + gz-l^+ p/ 
a 


a h 

a h g 

h b 

h h f 


g f e 


a h 
h b 
h g 

b f 
f « 


i(|; 


y+ 


ir 


a h g l\ 
h b f m\ 
0 f c 
I m n u\ 


which shows at once that the equations (1) must necessarily represent 
the conditions that T is to be a minimum, and shows also that Gauss's 
method of solution is substantially equivalent to the reduction of a 
quadratic form to a sum of squares. 

Ex. 1 .—To solve the equations 

a! + 3y—2z— 2w= 0*6, (1) 

3a! + 4y —6z4- w —3i;= 6*4, (2) 

— 2a; ~ 6y + 3z — 2m + 2v= — 5*0, (3) 

y-2z + 6w + 3vsas 7*5, (4) 

-2a;-3y+2z + 3M + 4v= 3-3. ^ (6) 
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Eliminating x from equations (1) and (2), we have 


1 3 

1-2 


1 0 


1-2 


1 0*6 

3 4 » + 

3-6 

» + 

3 1 

M + 

3-3 


3 6*4 


- 5y + z + u + 3v = 3-9, (6) 

and similarly, by combining (1) with (3), (4), (6) in succession, we obtain 
the equations 

y- a — 2 it — 2 t? = — 4*0, (7) 

y — 2a + 6 u + 3i; = 7*5, ( 8 ) 

3y-2z + 3u = 4*3. (9) 


‘ Now eliminating y from (6) and (7), we have 


-5 1 

-5 1 

,1-6 3 


-5 3*9 

1-1 *+ 

1-2 

“+ 1-2 

v = 

1-4*0 


or 4» +9w +7v= 16*1, (10) 

and similarly, combining (6) with (8) and (9) in succession, we obtain 

9a~26w-18i;== -41*4, (11) 

7a-18 m- 9v=-33*2. (12) 

Continuing this process of elimination, the following equations are 
obtained: 

186m+ 135v= 310*6, (13) 

136m + 86v = 246*5, (14) 

and finally, from (14), (13), (10), (6), (1), we determine the results 
- 1*4, Ms*2*7, a = 0*4, — 1*0, and x— 1*5. 

Ex. 2.— Solve by Gauet^e method the equations 

5sc-3y+7a+ m + 2v= 366*8, 

-3a; + 2y+ 2 m + 2v= 60*6, 

7a! + 2a+ M— v= 167*0, 

a;+2i/+ a+ m-6v=- 7*5, 

2a; + 2y- a — 5m+ v= - 71*2. 

120. The “ Method of Equal Coefficients ” for the Solution 
of Idnear Equations.* —In a method of performing the 
elimination, which is known as the Method of Equal Coefficients, 
the two iSrst equations are reduced to two equations between 
each of the two first unknowns and the remaining unknowns; 
then these two with the third equation of the original system 
are reduced to three equations between each of the three first 

* & I. Clasen, Bnm. 8oc. Se. M (1888), A 60-69, 6261-281. 
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unknownB and the remaining unknowns, and so on. The 
advantage of this method is that the same coefficients occur 
repeatedly, the elimination being thereby greatly facilitated. 
The method involves certain divisions, which, however, shovld 
always give exact quotients without remainders, thereby con¬ 
stituting a control of the accuracy of the computation. 

Thus let the equations be 

+ e^ + d^i + e^ = f^, 1 ^ 

a^ + + djjM + e^v =/g, 1^ 

a^x + + 633 + d^u + e^v =f^, I3 

a^v + hjj + c^sJtrd^u + e^v=f^, 1^ 

a^e + 65 ?/ + c^z + d^u + e^v 1 ^ 

Eliminating x between and l^, we have 

I«A1 y +1 “i'’21 *+1 I “+1 1 ®=I *1/2 1 . 2j 

and eliminating y between and 23 by forming the sum of 
-1 <1^63113 and 6^23 and dividing by a^, we have 

-1 «i^2 |a= + 1 V2 1 « + 1 M2I« +1V2 1 ^ = I hfzV 2i 

Now multiplying 2 ^ by <*3,23 by - I3 by | \ and adding, 

we have 

I «A '’3 I ^ + ! '4M3 I “ + 1 "A '’3 I ® = 1 "A/s I- 83 

Eliminating z between 23 and 83 by forming the combination 
I "1 Vs 1 2a - 1 "i«2 1 83. and dividing by -1 aj)^ |, we have 

-1 "A^sl y +1 I “ +1 "1V3 1 ^ = 1 "1C2/81. 8j 

and eliminating z between 2 ^ and 83 by forming the combination 
I «A®3 1 2i - 1 V2 1 83 and dividing by -1 a^b^ |, we have 

I *1 Vs I 1 Vs^s I ^ I Va^s I ® ~ i Va.^ !• *1 

Now form the combination - a 43 i + 6433 - 0483 +1 aA®3 1 V 
We get 

I "1 V 3^4 I « + I « AV 4 I ® = I « A<^ 3/4 I- ^4 

Eliminating « between this equation and 83 by forming 
I ai&3Ca^4 1 83 -1 aA<^ I *4 and dividing by - 1 a^^^ \, we have 

-1 a4&3<!3(f4 \z + \ a^id^^ I ® = I I* 


4s 
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and similarly 

I ^3^41 ?/ + I 1^ = 1 «1^2^3/4 l» ^ 

— I ^i^2^3^4 I ^ I ^1®2^3^41 “ I ^1^2^3«^ I* 

Now form the combination - 6^42 + 

We get 

\a^\c^d^e,\v =\aj>^c^dj^\. 65 

Eliminating v between 65 and 44 by forming 1 cti\c^d^e^ 1 44 
-1 I ®6 dividing by | |, we have 

— I (l^^ 2 ^d^ej^ I ^ I %^2^3^4^ 1 > 84 

and similarly 

I ^i^ 2^3^4^6 I ^ “ i ^1^2^3^4.^ I> ®3 

— I ^i^2^3^4<^5 I y == I ^V2^3^4./5 ®2 

1 ^>^i^ 2^3^^4^6 I ‘^' “ I l‘ ®1 

The solution is thus completed. 

Ex. 1 .—To solve the equations 

x+3y-2z 0*6, 

ex + 4cy-6z+ w~3f= 5*4, I 2 

— 2a; — + 3» — 2i^ + 2v = — 5*0, Ig 

y — 2z + 6u + 3v= 7*5, I 4 

- 2a;—3y + 22 +3ia +4v= 3'3. I 5 

Eliminating x from equations 1^ and Ig, we have 

- 5?/ + 2 + u + 3i; = 3-9, 2 ^ 

and eliminating y between and 22 by forming the sum of 6 . 1 ^ and 
3 . 22 ) obtain 

5a;-72f + 3 t 4 -v= 14*2, 2^ 

in which the coefficient of x is minus the coefficient of y in 22 - 

Now multiply 2^ by - 2 , 22 by 5, I 3 by - 5, and add to form the 
equation 

4» + 9a+7v=16*l, 83 

in which there are no terms in x and y. 

Eliminating z between 83 and 83 by forming the combination 
•^( 4.82 - 83 ), we obtain the equation 

— 4 i/--M + v®* — 0*1, 82 

in which the coefficient of y is minus the coefficient of z in 83 . 

Now eliminating z between 8 ^ and 83 by forming the combination 
^ 4 . 82 + 7 . 3 ^, we find 

i 4a;+15u + 9v = 33’9, Sj 
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in which equation the coefficient of a; differs only in sign from 
coefficient of y in 3^ Now form the sum + 2.83 + 4 .I 4 , 

37w + 27v=62-l, 

and eliminate u between this equation and 83 , 

— 37« — — 9*2. 

The remaining equations of the process are 

372/-lev = -14*6, 

-37a;-f 18v= -80*7, 

-25v=: 35*0, 

25a= 67*5, 

- 2521 = 10 * 0 , 

26y= -25*0, 

-25a;= -37-5. 

The required solution is therefore 

a!=l« 6 , i/=-l' 0 , « = 0*4, t 4 = 2'7, v=—1*4. 

Ex, 2,^-Solve by the above method the equations 


5a:— 

3// + 


u 

+ 

2v = 

356-8, 

3x + 

22/4- 


2u 

+ 

2v = 

60-5, 

7a: + 


22 + 

u 

- 

v = 

167-0, 

« + 

2y + 

« + 

u 

- 

5v= - 

- 7-5, 

2x + 

2y- 

2 - 

5u 

+ 

- 

- 71-2. 


121. Comparison of the Three Methods of solving Normal 
Equations. —Comparing the three methods which have been 
given—the Determinantal method (§ 117), Gauss’s method 
(§ 119), and the method of Equal Coefficients (§ 120)—we may 
say that the Determinantal method is on the whole the best for 
the solution of a set of normal equations. It should be observed, 
however, that the superiority of the Determinantal method 
depends on the circumstance that a set of normal equations 
is always axisymmetric {i.e. the coefficient of y in the normal 
equation for x is equal to the coefficient of x in the normal 
equation for y). For a set of linear equations which is not 
axififymmetric, the Determinantal method is inferior to the 
method of Gauss. 

122 . The Weight of the Unknowns. —^We sb^ll now 


the 


4. 


^8 


*2 

®6 

84 

5s 

5, 

5i 
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"m 

investigate the weights of the determinations of the unknowns 
t. Denote the most probable values of the unknowns 
hy tff We have in the notation of § 117 

DiTq = A^C^ + A 2]^62 • "t* An^lC^, 

where D and the minor Apj depend only on the accurately- 
known coefficients a, ... of the equations of condition and 
where 

Cj = + WgagTij + • • • + 

Cj = + wj)^n^ +. . . + wfi^n, 

so = +^2^2712+ • • • + (1) 

where 

+. . • = <*1 ’ fi > 

^^'21 ^22 ^23 • • • ^'2w 
^31 ^32 ^*^33 • • • 

^ml ^m2 ^wi3 • • • ^mm 

^2 ~ ^'2 ^2 ^2 • • • > © tC . 

^21 ^22 ^^'23 • • • ^ 2m 

• • • • 

^wi2 ^w3 • • • ^mm 

By giving to Wj, . . ., n, the particular values a ^,. . ., a„ in 
which case we should evidently have * 2 = 1 , we derive from 
equations (1) the result 

D = Km], (2) 

and similarly by giving to n^, the values , . ., l„ in 

which case * 2 = 0, we have 

0 = [« (3) 

Let uox be the weight of the determination of * 2 - Then since 
96^ is given by equation (1), we have by the formula for the 
weight of a linear function of n^, . . 
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D® 

—+ + . . + 

w*^A • • • «’A/* 1 +• • • 

^21 ^^^22 • • • ^21 ^22 * * * ^ 2 m 

••••••• •• 

^ml ^j«2 • • • ^wl <^m2 • • • 

+ 

^'21 ^22 ■ * * ^2»n 
^ml ®m 2 • • • ^wim 



[wafl [w&l] . . . \;wf^] 

^21 ^22 • • • ^ 2 m 

• • • • • 

^ml ^»w2 • • • ^mm 




These formulae give the weights of the determination of 
a?, y, . . .; they are due to Gauss.* 

It is a consideration in favour of the determinantal method 
of solution of the normal equations that as D, Agg, • • • 
are calculated in order to find x, y, the method furnishes 
the weights without any fresh calculation. 

Now let € denote the mean error to be feared in an observa¬ 
tion of unit weight, and let denote the mean error to be 

g 2 

feared in the determination of x: then since = we have 

^ * Theoria Comhinationis, § 21 . 

<d 8 U> 



242 THE CALCULUS OF OBSEEVATIONS 

We shall see later (§ 124) how e may be found in terms of 
the residvials of the equations of condition. 

123. Weight of any Linear Function of Unknowns. — 
Now let 

u = l.^ + l^ + . . . + 

be any linear function of the unknowns x, y,z,..t, so that its 
most probable value is 

where {x^ yo> • • •> ^o) probable values of the un¬ 

knowns. Let it be required to find the weight of the equation 

It will be sufficient in the first place to take M = ia; + l'y. 
Writing as before 

I ^2 = Ajifflj + AjiJg +. . . + Aoti.^, 

and writing 

=Ajg^g + Agg^^ +. . . + etc., 

we have + ^gWgWg +. . . + 

Uyo = %«’i7ig + ij2W2mg + . . . + 

and therefore 

Dtto = (^^1 + + (^^2 + ^'■ n 2)'^%^2 + • • • + (^^* + 

Therefore if denote the weight of the equation u = % we 
have 

~ = {l$j. + + (^^2 + ^’v^^2 + • • • + (^^« + ^'v»)^e 

Now we have already proved that 

^1^1 +$2*^2 + - • . + ^.®m’» = DAu, 
and in the same way we may show that 

^li/jWi + ^gi;gWg + . . . + ^,»;,W, = DAgg. 

T)* 

Therefore - =TOA„ + 2in)Ai8 + TOAgg 

or ^ = AggP+2Agg;i' + Aggi'*, 

( * Gauss, Theoria (hmbiniUionia^ § 29. 
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and similarly in general if is the weight of the function 

+ + . we have 

+ Ajj/j® + . . • + AB|)ni)n®+2 Aj2/j^2 + . • • 

Wu 

This formula gives the weight of the determination of any 
linear function of the unknowns. 

124. The Mean Error of a Determination whose Weight 
is Unity.*—We shall now find the mean error of a determina¬ 
tion whose weight is unity. For simplicity we shall suppose 
that the equations of condition have been multiplied by the 
square roots of their weights, so that they may be taken to be 
each of unit weight. Let them be as usual 

.. (1) 

Write 

+ 6,.?/ +. . .4- ft - = Er; so Ep Eg, . . ., Eg are the true 

errors of n^, . . n,, if x, . . *^1 are supposed to be the true 

values of the m unknowns. 

Write also 

• "J" fr^Q > 

SO Vg are the residuals when the most probable values 

sCq, . . ,, tg are substituted in the equations of condition. 

We have evidently 

M = 0, [H = 0, . . . [>] = 0. (2) 

Also 

E,-®i = rti(a:-ir„) + 6i(y-yo)+. . .+/i(<-<o)l 

;•.[• ( 3 ) 

E,-®,=a,(£B-a!o) + &,(y-yo) + - • 

Multiplying (3) by v ^,. . and adding, remembering (2)^ 
we have 

[E®] - [w] = 0. 

Multiplying (3) by , E, and adding, we have 

[EE] - [Ei>] = [aE] (a; - a\,) +. . • + f/E] (< - <o)> 

80 [EE] - [w] = [«E] (aj - a!o) + • • • + [/E] (# - Q. (4) 


* Gauss, Theoria CombiruitioniSt §§ 37, 38, 39. 
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Now we have already seen that if coefficients aro 

defined by the equation 

then [a€] = D. [6^] = 0, . . .. C/?] = 0, 

and 

[ef] = +. . . + L/?]<o - M = Da;„ - [«^] = 0. 

Multiplying (3) by i, respectively and adding, 

remembering the equations just written, we have 

KE] = (a:-a^)D. 

Thus (4) becomes 

D[EE] - D[H = [aE] KE] + [6E] ^E] +. . . + [/E] [tE]. (6) 

This is a relation between the sum of the squares of the 
residuals and the sum of the squares of the true errors. As it 
stands, however, it is not sufficient to determine [EE], since 
the quantities E occur also on the right-hand side of the 
equation; but we may overcome this difficulty in the following 
way: 

Suppose the set of observations repeated by the same 
observers with the same instruments under the same conditions, 
N times, where N is a great number. For each of these sets 
of observations we shall obtain an equation corresponding to 
(5). Let these equations be added together, and the resulting 
equation divided by N. The values of D and the a% b% . . 
/'s, ^s,. . ., T*s are the same for each set of observations, but 
the E’s, and v's differ from one set to another. Using the, 
symbol 2 to denote summation over the different sets of 
observations, we have 

g2[EE]-52M 

= ^'S[aE] KE] + i 2[&E] [,E] +. . . + ^2|yE] [rE]. 

Now if pomtive and negative errors are equally liable to 
occur, each of the sums ^2EpE, evidently tends to zero as N 
inoreasas indefinitely: and then the equation becomes (when 
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we understand the sign of equality now to indicate that the 
two sides of the equation differ only by negligible quantities) 

• + + . . . 

Now ^2 Ej2 = ~ 2 E 2 * = . . . = ^SEj,* = the square of the true 

quadratic mean error, i.e. the value which we should deduce 
from an indefinitely great number of observations. 

Therefore, denoting the quadratic mean error by e, we have 




" (^ 1^1 + ^ 2^2 + • • • + + . . • + {f’Jx + . . • +/« T #)€^ 

= I)m€^ 


so 




1 

s - vh N 


In this equation € is the true value of the quadratic mean 
error to be feared in a determination whose weight is unity, 

and is the mean value of the sum of the squares of the 

residuals. Since the true value of is unknown, we are 

obliged to be content with that which is furnished by the 
unique actual set of observations: so for the most probable 
value of e we have the equation 

.*= M.. 

s-m 


This is Gauss’s expression for the quadratic mean error e to be 
feared in a determination whose weight is unity; s denotes the 
number of equations of condition, and m is the number of the 
unknowns x, y, . , f. The quantity 

is therefore well adapted to measure the precision of the given 
set of observations. 
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125. Evalnation of the Sum of the Squares of the 
Residuals.—We shall now show how [®®], the sum of the 
squares of the residuals, may be expressed in terms of the 
coefficients in the equations of condition and normal equations. 
We have 

[®*] “ (®i®o ^1^0 + • • • + ./i^o ~ (®»*o + • • • ~ ^•)* 

= [aajxg* + [+. . . + 2[aJ]a:;^o 4-. . .-2[aw]a:j-. . . 

+ [»«]. (1) 

and we know that is the minimum value of the quadratic 
form 

[oajr* + [66]y® +. . . + 2[a6]a:^ +. . . - 2\an\x . . . + \nn\. 

From the general theory of quadratic forms we know that 
the minimum value of this last form (which we know to be 
essentially positive, since it is a sum of squares) is 




[ffla] [a6] . . . [af] 

[ah] [Si] ...[&/] [jTi] 

[an] [ 6« ] . . . [fn] [nn] 

[«a] [aft] . . . [a/] 

[oJ] [6J] . . . [&/] 

« • • ■ 

[«/] [¥]•••[//] 


( 2 ) 


We may also establish this formula directly in the following way. 
We have already proved (§118) that 

[««]= -[an]a:o-[6»]y„-. . . - |>]«o + [««], 


and substituting for jcq, their determinantal values, we 

obtain the equation (2). 

Combining the results of this section with the equation (§ 124) 

8-m 


and the equation (§ 122) 


6* = C«. AU 


D’ 


we see that the quadratic mean error to he feared in^ the deter¬ 
mination of X is €xi 
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where 






[aa] 

[ab] . . 

. [«/] 

[an] 

[66] 

[be] . 

. [¥] 


[ab] 

[66] . . 

. m 

[6n] 

[6c] 

[cc] . 

• W] 


• 

• 

• 

• 

W] 

W] .. 

(s - 7n) 

■ • iff] 

\ M [< 

ll 

[an\ 

.]77 

[bn] . . 

• ¥f]f 

■ [>] 

[nn] 


[ab] [66] . . . [6/] 

[«/] m ... [//] 


If the original observations are weighted we must write \wad\ 
for [aa], etc. 

Ex, *—Suppose that observations of equal weight give 

X- y + 22 = 3 
3aj 4- 2y - 52 = 5 
4x+ ^+42 = 21, 

while an observation of weight ^ gives 

- 2 a; + + 62 = 28, 

For the last equation we substitute 

--a;+ 3^ + 32 = 14, 

and the four equations are now of equal weight. 

The normal equations are 

6a:o+15yo + *’o=’^® }-, 

3 /(,+ 64»o=107j 

, . , . 49164 2617 12707 

which give 19899. yo= *0=-^. 

or a;Q= 2*470, i/q= 3-551, 2 ^= 1*916. 

The weight of the determination Xq is where 

1 Ajj I 1 54) 809 

19899’ 


809 ’ 

^ , 737 2211 

and similarly Wy = -gj-, Wg = — • 

* This example was used by Gauss himself as an illustration of the Method 
of Least Squares. 


Wx D 27 6 0 

6 15 1 

0 1 54 

19899 
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The leeidnals ate 

4960 1320 1880 1400 

”19899’ ”19899’ 19'899’ ”19899’ 


and therefore the euiu of the squares of the residuals is 




1600 

19899’ 


so the quadratic mean error to be feared in the determination of x is 
[v*] 1294400 

where € 3 . 2 =—^ = ^9899)*’ for €y^ and 

126. Other Examples of the Method. —The Principle of 
Least Squares is applied in the first of the following examples 
to a problem of curve-fitting, and in the second to a geometrical 
construction. 

Ex, 1 .—Find the values of the constants a, 5, c, which nearly satisfy 
the equation 

h c 

/(a?) = a + - + -2 


for values of x between q and p, where f{x) is a given function. 
In this case 




is to be made a minimum, so differentiating with respect to a, 6 , and c, 
•we have the three equations 


ajf(fa + 6jf 


From these the values of a, h, c can be determined. 

Ex. 2 .—The position of a point in a plane is determined as the inter’' 
section of several lines furnished by observation. Owing to errors of observa^ 
tion the lines are not exactly concurrent. To find the most probable position 
of the point.* 

Denote by d^ . , dn the distances of a point of the plane from 
* d*Ooagne, Joum. de vicoU Pol. cah. 68 (1898), p. 1 . 
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the n given lines; let Aj, Ag, . . he the weights attached to those 
lines : then the required point is that for which 

Ai^i^ 4* Agdg^ +. . , + 

is a minimum. This point may be called the centre of least squares of 
the system. 

The centre of least squares is the point for which the function 

"KittiX + biy + Ci)* 

i-i a^ + h^" 


is a minimum : so its co-ordinates are given by 


0S 

dx 


0 , 


as 


= 0 , 


« Aiaf(ofa; + % + ef) ^ ^+ % + «<) „ „v 

“ h ..= ' - 

Now if 0 is any point whatever of the plane, and Oj', Og', . . 0^' are 

the images of 0 with respect to the given lines, the centre of gravity 0' 
of particles of masses Aj, Ag, . . ., A^^ at 0^', Og', . . ., 0,/ may be called 
the symmetric harycentre of 0 with respect to tlie system of lines. 

If 0 be (X, Y), the symmetric harycentre 0' of O has the co-ordinates 
(X', Y'), where 

V / V 2 « kiatiaiX -H hiY -f ct) 


2 ”Xihi{aiX + hjY + Cj) 

■"L^rr ’ 


( 2 ) 


where L = Aj 4-. . .4- A,i. 

Now suppose the origin taken at P, the centre of least squares, so tnat 
by (1) we have 

V . Q 


and suppose the direction of the axis Pa; chosen so that 


V 

a\^+h^ 


0 . 


Then the equations (2) become 


X' = ( 


and since | 


these may be written 

(d311) 
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Let 0^ be the symmetric liarycentre of O'. 

The co-ordinates of 0^ are X'' = and therefore the line 

OO'' pasm through P. Moreover, the lines PO and PO' are equally 

. ^ ^ PO' PO^ 

inclined to Pa;, and =s /* = 



Therefore the triangles PO'O'' and POO' are similar, and O'P makes 
with O'O'' an angle equal to the angle O'OO". Thus the centre of least 
squares P is the intersection of 00" with the line O'P which makes 
with O'O" an angle equal to O'OO": or otherwise expressed, the centre of 
least squares is the intersection of 00" with the tangent at 0' to the circle 
OO'O" 

127- Case when two Measured Quantities occur in the 
same Equation of Condition. —We shall now consider the case 
in which the quantity »in an equation of condition 

ax + by + cz + . . .+ft = n (1) 

is not itself a measure derived from a single observation to 
which a known weight is attached, but is a known function of 
two meeisures p and q derived from different observations to 
which weights Wp and Wg are attached. It is required to find 
the weight which must attached to the equation (1). 

Let the expression for n in terms of p and q be 

n = i>{p,q). 

Then a small error e in and a small error /tin q give rise 

. d<f> 

to an error ^ + 
op 

given by the equation 

\dp/ Wp \dq/ 


d<h . 
^m«; 


and therefore the weight Wn of ^ is 
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The equation (1) is now to be treated as if n were a 
measure derived from a single observation having the 
weight Wn- 

128. Jacobi’s Theorem. —It was shown by Jacobi* that 
the values of the m unknowns which are obtained from s 
equations of condition by the Method of Least Squares may be 
derived also in the following way. Take any m of the equations 
of condition and solve them as ordinary algebraic equations, 
obtaining (say) a value 



(where A^ and are determinants) 


for the unknown x. We can select the m equations for this 
purpose in p ways, where p = and thus obtain p values for 
X, say 


Aj Aa Ap 

Then the value of x given hy the Method of Least Sqtiares is 

_ApBp 

+ + , . . + Bp2 ‘ 


This result is an immediate consequence of Cauchy’s well- 
known theorem on the equivalence of the two forms in which 
the product of two arrays can be expressed: thus in the case 
of s = 3, m = 2, what is to be proved is 


a^x^ + a^x^ + a^x^ 
+ \x^ 

a^ +a^^ +a^ 


&2^+ ■ 


h 3 

^3 


+ af)^ + af)^ 


®A 

^A 

4 . 

®A 


4 . 


aA 


^'2^2 


®A 

aj >3 

1 


0363 


aA 

af)^ 


^363 


«2^2 

«3^3 


which follows at once from the theorem on the product of two 
arrays. 


♦ Joum. fUr Math, 22 (1841), p. 286. 
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Eon ,*—^Consider the system of four equations used by Gauss, and 
worked out in Ex., § 125, above, viz. 


as- 2/ + 2»= 3, 

(1) 

3a! ^ 2y ““ 5z — 5, 

(2) 

4a: + T^ + 4i<} = 21, 

(3) 

- x + 3y + Sz=^14. 

(4) 

Now equations (1), (2), (3) alone would give 



_115 _65 

36’ ’•'“35’ *“35’ 


the numerators and denominators being the exact values of the de¬ 
terminants, thus, 


1-1 2 


3 2-6 = 

35, etc. 

4 14 


Equations (1), (2), (4) would give 


122 167 

93 

*=- 47 -. 

^-- 47 * 

Equations (1), (3), (4) would give 


78 113 

67 

*“33’ 

^ = 33* 

Equations (2), (3), (4) would give 


- 304 - 444 

-236 

®“-124’ ^“-124’ 

^“-124’ 


(90 X 35) + (122 X 47) + (78 x 33) + (304 X 124) 

362 + 47 *+ 33 *+ 124 * > 

49164 , , 

=-, ai before. 


129. Case when the Unknowns are connected by 
Rigorous Equations. —It frequently happens that the un¬ 
knowns X, y^z, . , .,t are not independent, but are connected 
by rigorous equations: for instance, if a?, y, z represent the 
three angles of a triangle, they are connected by the rigorous 
equation x + y + z^ir. In order to discuss this case we shall 
suppose that the unknowns are given as before by a set of 
linear equations derived from observation 


a^x + \y + . . = 


* Qlaisher, Mmth, Not. 40 (1880), p. 600. 


(1) 
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(where w, are the observed quantities), together with 

p rigorous equations 

<t>i (®. y, • • ;t) = O' 

0=0 ^ 2 ) 

0 = 0 . 

We shall suppose that (by multiplying each of the equations 
of condition by the square root of its weight if necessary) the 
equations of condition have been rendered all of equal weight. 
Then, as in the proof of § 125, we see that the most probable 
values of the unknowns are those which make 


E = [a«]i6'®++ . . . + 2[a&]a?y + . . .-2\an'\x-. . . + 
a minimum, subject to the conditions (2). We must have 
therefore 


OPjj OHi , A 

where dx^ dy^ . . are subject only to the conditions 
+ + . . . + ^d< = 0 ’ 


and therefore the unknowns are to be determined from the m 
equations 


dx 


+ A, 




4. A rr* 4. 

dx ^ dx 







-^1'- 


= 0 


3E » 


+ Aj- 




dt 



= 0 


(where A,, A,,. . Ap are unknown multipliers), together with 
equations (2). We have therefore (m+p) equations to de¬ 
termine the (m+p) unknown quantities a;, y, A^,Ap. 


Ex. — The measures of the four angles of a plane quadrangle are 
a, )8, y, 8, with weiglUs g^, g^, g^ g^ respectively. Find the most prohahU 
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valms of the angles^ and show that the weight of the mine found for the first 
angle is 

OMb + W4 + W4 + W3.^4 

so that when all the weights are equals the weight of the final value of an 
angle is ^ the weight of an observation, 

Tjet the angles be 6^ 6^, 0^, The equations of condition, reduced 
to unit weight, are 

V'/3^3='«/w> ‘j94Pt='j9t^ 

and the rigorous condition is that 

+ ^2 d“ ^3 "i" ^4 “ ^TT. 

We have therefore to make 

9l(0l - «)» + ff2(^2 - Pf + »3(«8 -yf + 94(^4 - 8>* 

a minimum, subject to tlie last condition. We have therefore 

9l(^l - + 9^&2 - + 9d^3 - = 0 . 

where the differentials are subject to the condition 
dOj^ + d$2 + dO^ + d6^ =0,. 

and therefore 

- a) = -p)=‘92(^3-y) = 94.^2 - 

We thus obtain 

0 ga!/3g4(« + ^ + y + S-2^r) 

^ “ 9 ^^ 2 + 9 i 9^2 + 91^4 + 9 i 9^3 

0 9jm+MA±M^a-S^(P + y + S-Z^\ 

^ ^929394 ^92939* ' 

and similar expressions for the other angles. 

Denoting the weight of by Wp we have at once 

1 -f 9i<9203 + 9s92 + 929^ Y^ + ( M^Y(1 +1 + 1-'] 

Wj \ ^29394 f 9i \g^ gj’ 

=^^^^^^®{9i(9a98+9894+ 9492 )+ 929s?4}. 

9293 + 9894 + 9492 
2929394 ’ 

o, W ^ 929894+9i9894+9t9a94 + 9i9^8 

^ 9293 + 9894 + 9492 ’ 

wUdi is t]ie Tsqnired result 
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130. The Solving Processes of Gauss and Seidel.— 

When the number of unknowns is great, the algebraic methods 
for the solution of the normal equations, which have been 
described in §§ 117-120, become exceedingly laborious: under 
these circumstances, the normal equations may be solved by 
a method of successive approximation in the following way. 

Writing the normal equations 

+ ^122^ + • . . + = 

«ln'^ + «2ny + - • = 

we first assume for x, y, .... t any system of values. Since 
in the normal equations the diagonal coefficients are sums of 
squares, whereas the non-diagonal coefficients are sums of 
products of which in general some are positive and some are 
negative, it is most often found that the diagonal coefficients 
are larger than the others, and therefore the corresponding 
terms are most important: so we therefore generally take as 
initial values for x,y, , . t the numbers 


A A 

^22 ®wim 

respectively. 

With these assumed values of a:, y, . . ., t, we calculate the 
quantities 

Ni = + ^igy +. . . + 


and take 


^11 


We can now assert that by swiding Aa: to the assumed value 
of X we are improving it: for if we denote by Q the sum of the 
squares of the residuals when the assumed values are put for 
x,y,., ., ty we have 

+ 2aijnXt 4 *... - 2CyX «... + 


’^{aj^x + aj^ + . . . + aim#-Ci)V«u 

+ terms depending only aa.y,x, . , ^ t. 
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If in this we replace a? by a; + Ar, without changing y, 
the effect is to destroy the term {a^x +. . . + in Q 

without affecting the other terms: that is, the effect is to 
diminish Q. Now the set of values of x,y, . . t which we 
wish to obtain are the set which make Q a minimum: and 
therefore if we say that one set of values of x,y,,..,t is an 
improvement in another set when it corresponds to a smaller 
value of Q, we can say that by adding Ar to Xy without changing 
the values of y, . . ty we are obtaining an improved set of 
values for the unknowns. 

Now with the improved set of values of y, . . ^ we 

calculate the quantity 

N2 — + a-gjg?/ + • • • + Cl 2 mP ^2* 

N 

and take Ay =- K 

In the same way we can show that to add Ay to the assumed 
value of y is an improvement. Proceeding in this way, we 
improve each of the values y, • . ^ in succession, and then 

return to x. The process, which is due to Seidel,* may be 
stopped when the residual N^s are sufficiently small. In 
common with all iterative methods of approximation, it has 
the advantage that an error of calculation continually corrects 
itself in the subsequent steps of the process. 

Ex ,— Let us solve by this process the normal equations of Gauss’s 
original example, namely 

( 27a;+ 6y =88, 

6a;+ 16y + « = 70, 
y + 54» = 107. 


We take as a fist approximation 



*= »|. =3 roughly, 
tt roughly, 

* = = 2 roughly. 

Then 

Ni=27® + 6y-88 = 28, 

and 

A*- - -||= -0-86. 


Milnch, Ahh, 11 (1874), Abt. 8, p. 81. 
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Thus the improved set of values is ac= 2*15, 5, «*= 2. These give 

N 

N2 = 6a!;+16y+ 70 = 19*9, and therefore Ay= — —1*33, so we 

now have as = 2*15,1/ = 3*67,2 = 2, and N3 = ^ + 542- 107 = 4*67. Thus 
N 

A*= -5^= -0-086, giving a= 2-16, y = 3-67,*= 1-914, and Nj= -8. 

Repeating the process, we have 
N 8 

Aa!= - ^ = ^= 0*296,giviiiga; = 2*446, y = 3*67,»= 1*914, 1*640 ; 

N 

Ay= -0*1093, giving 05=2*446, 1/= 3*661, «= 1*914, 

N3=-0*083; 

A N, 

A2= ■’^=0*01, giving 35=2*446, t^= 3-661, 2=1*915, Ni= -0*692; 
N 

Aa;= - J=0*022, giving 35= 2*468, ^ = 3*561, 2=1*915, N2=0*138; 
N 

Ay= - - 0*0092, giving 35 = 2*468, y = 3*552, 2= 1*916. 

These differ only by at most 0-002 from the true values. 

If Sciders process be carried out for a set of equations with 
literal coefficients, such as 

( ax + hy 
hx 

gx +fy + cz = n, 

it is readily seen that the value of x is what would be obtained 
from the formula 


X — 


abc 


I m 
h I 

9 f 



ahc he ac ah) 


c 


by expanding the last factor by the binomial theorem as an 
infinite series. 

A method closely akin to this had been communicated many 
years before by Gauss to Gerling.* It may be illustrated by 
the following example: f 

* Of. the appendix to Gerling’s work on the application of the calculus of 
compensation to practical geometry (1843), p. 386. Another very similar 
process was descriM by Jacobi, Ast, Nach, No. 623 (1846), p. 297. 
t 0. A. Schott, IKS, Coast'Survey Bep, (1866), p. 266. « 
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Let the normal equations be 

0 = 2*8 + 76af — 30y — 202 — 26t*, 

0 == — 4‘1 — 30a? + 83y ■— 25x — 28w, 

0 = — 1*9 — 20a; — 25y + 892 — 44u, 

0 = 3-2 — 26a? — 28y — 442 + 98tt. 

To ascertain which of the unknowns will probably be the greatest, 
we examine the quotients: 

,._^.-0.03, ,.«_+0 04. 


u = — 



-003. 


Accordingly we begin with y and write y = 0*04 Ay . The equations 
now become 

0 = 1'60 + 76 a ? — 30 Ay — 20z — 26 m , 

0 = — 0*78 — 30 a ? + 83 Ay — 252 — 28 m , 

0 = — 2*90 — 20 a ? — 25 Ay + 892 - 44 m , 

0 = 2*08 — 26 a ? — 28 Ay — 442 - j - 98 m . 


This gives for quotients 


a? = 


1*60 

'76 




-0*02, Ay=^ = 001, 2 = 


83 


2*90 

89 


= 0*03, 


u = 




We therefore now substitute z = 0 03 + A 2 : and proceeding in this 
way, the whole solution may be brought to the form: 


IT« 0*04 

M > 0-03 

Ay - 0 01 

*= -O'Ol 

«»=. -001 

Ax 

- -0-003 

A* 

-0 002 

AN » 0-001 

0 = 1*60 

1*00 

0*70 

- 0*06 

0*20 

— 

0*028 


0*012 


- 0*78 

- 1*53 

- 0*70 

- 0*40 

- 0*12 

— 

0*030 

+ 

0*020 


- 2*90 

- 0*23 

- 0*48 

- 0*28 

+ 0*16 

+ 

0*220 

+ 

0*042 


+ 2*08 

+ 0*76 

+ 0*48 

+ 0*74 

- 0*24 

— 

0*162 

— 

0*074 



The operation is now completed if we are satisfied with two places 
of decimals, and the first unknown quantity isa?+ Aa?-|- A®a?+. . .or 
a? = -0*013. 

Similarly y = 0*050, 

2 = 0*028, 

M = - 0*009. 

131. Alternatives to the Method of Least Sgaares.~~At different 
times various methods have been proposed, other than the Method of 
Least Squares, for dealing with problems which are commonly solved 
by that method. We shall now notice briefly some of these: 

1^. The Method of Tobias Mayer. —In the latter half of the 
eighteenth century the most plausible values of the unknowns were 
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commonly found by a method which had been published by Tobias Mayer 
in 1748 and 1760.* It consisted in arranging the equations of condition 
into sets, forming the sum of the equations of each set, and treating the 
equations so obtained as normal equations. The method is decidedly 
infeiior to the Method of Least Squares. 

2®. The Method of Minimum Approximation. —Consider a system 
of 8 incompatible linear equations in m unknowns (a>??i): 

(1) aijc + % + . . .+/it = ni (1=1, 2, . . ., s). 

Replace it by the following: 

(2) ai!B + 6iy + . . .+/it-ni==ri (»=1, 2, . . ., «), 

where the quantities may be called residuals. The name minimum 
approximation of the system (1) was given by Gocdseels If to the smallest 
value which we can assign to the absolute value of the greatest residual 
of the system (2), in older that this system (2) may be compatible. 

The problem of determining the minimum approximation of a 
system (1) was enunciated and solved by Laplace { in 1799, but his 
method involved such laborious calculations as to be in general 
impracticable Six years later, in 1805, Legendre proposed the same 
problem, in the appendix to his Novvelles Me'thodes pour la determination 
des orhites des comktes : he found no easy method of solution, and proposed 
to replace the method by that of Least Squares. 

A much better solution was given in 1911 by C. J. de la Vallee- 
Poussin.§ 

3®. Edgeworth’s Method. —Taking the data as usual in the form 
a^x-\-\y + . . = 

a^ + hgy + . . = 

where 91 ^, Wg, . . ., n, are measures of equal weight, F. Y. Edgeworth in 
1887 II proposed to define the most plausible values x, t/, . . ., f in the 
following way : x, y, . . t are to he such as to render minium the sum 
of the absolute values of the residuals, 

|a^a; + . . + + . . .+/ii<-»2l + * • • 

+ |a.a: + 6»y + . • •+/«<-«f|- 

It may readily be shown that this rule is derivable from the 
hypothesis that the law of error is of the form 

where * is taken positively in both directions. 

* Kosmographische Ncuihrichten und Sammlungen. 
t P. J. E. Goedseels, Th^orie des erreurs d*observation ; Louvain (1907). 
t Micanique celeste, Livre III. No. 39. 

§ Annales de la Soc. *%. de Bruxelles, 35 (1911), B, p. 1. 

II Phil. Mag. 24 (1887), p. 222, and 25 (1888), p. 184. 



CHAPTER X 


PRACTICAL FOURIER ANALYSIS 

132. Introduction. —For the description of phenomena 
and the solution of problems in Physios, Astronomy, and 
Meteorology, much use is made of Foui4er series, that is to say, 
series of the type 

+ tti cos 6 + flj cos 26 + cos 30 
+ sin 0 + sin 20 + sin 30 +. • . (1) 

where a^, . . . are independent of 0. 

Consider, for example, the vibration of a violin string. Let 
a stretched elastic string be fixed at its end-points and take 
the axis of x along the string, so that the abscissae of the end- 
points can be taken to hea;=>0 and x=l‘, let y be the dis¬ 
placement (in a direction perpendicular to the string), at time 
t, of the point of the string whose abscissa is oe. Then when 
the string is set into vibration in such a way that it emits its 
fundamental note, unmixed with any overtones, its vibration is 
.represented mathematically by the equation 

y=A8in^8in {Xi + a), 

where 2ir/A is the period of the note in question, X. depending 
on the mass, length, and tension of the-string, and where A 
and a are arbitrary constants. If the string is set into vibra¬ 
tion in such a way that it emits its first overtone (the octave 
of the fundamental note) unmixed with any other sound, the 
vibration is represented by 

y-Bsin^sin (2Xt + /S), 

260 
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while the second overtone is represented by 
y = C sin sin (3A^ + y), 

and so on. When the string is set into vibration in a quite 
general fashion, so that all the overtones are present in the 
sound emitted, the displacement at time i is represented by a 
sum of these terms; thus 


y = A sin ^ sin (Ai + o) + B sin —^ sin (2Xt + fi) 

+ C sin sin (3A^ + y) +. 
and therefore the velocity at time t is represented by 
^ = AA sin ^ cos (A< + o) + 2AB sin cos (2Xt + j8) 

+ 3AC sin -j- cos (3A^ + y) + . 


2jra; 


( 2 ) 


(3) 


Suppose that at the initial instant, ^ = 0, the displacement 
and velocity at every point of the string are given; let the 
displacement be and the velocity be \//{x). Suppose, more¬ 
over, that we are in possession of a method which enables us to 
express a given function /(x), which vanishes at a; = 0 and a? = /, 
as a series of the form 


^ . tto; , . 27rx , . 37ra; 

f(x) = sin + 62 sin -j- + 63 sin -j- 4 -. 


(4) 


where b^, Jg, Jg,. . . do not depend on x, but depend upon the 
nature of the function /{x). Applying this theorem to the 
function <l>{x), we should have an equation 


. . vx . 2irX . 5irX 

ij>{x) sin +|J2 sin —j-+Ps sin -j- +. 


( 6 ) 


where p^, p^, p^, • • • may be regarded as known, since the 
function <l>{x) is given. Similarly 


. . wx . 2jra! . Sttx 

i//(x) q^sm -j- + SzSiD. -j- + q^am -r- +. 


where q^, q^ qt, • • • may be regarded as known. 


( 6 ) 
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But putting ^ = 0 in (2) and (3) we have 
^(®)-»A8mo8in^ + Bsinj38in^^ + CsinyBin^^ + ... (7) 

^(®) ■» XA 008 o sin y + 2XB cos /3 sin 

+ 3XC COB y sin . . . (8) 
Comparing (5) and (7) we have 

Asina=2)i, BBin)8=iJj, Csiny-pj ... (9) 

Comparing (6) and (8), we have 

XAooso = 2i, 2XBcob)8 = Jj, SXCcosy-yj . . . (10) 

The systems of equations (9) and (10) enable us to find the 
unknown constants A, a, B, )8, C, y,. . . in terms of Si>jP 2 > 
that is to say, in terms of known quantities. Thus equation 
(4) enables us to analyse the initial data into constituents 
A, a, B, jS, . . . such that the first pair of constituents (A, o) 
gives rise to the fundamental note of the string, the second pair 
of constituents (B, J3) gives rise to the first overtone, the third 
pair of constituents gives rise to the second overtone, and so on. 

As a second illustration consider the Theory of Tides. The 
tide<generating potential due to the sun and moon may be 
expanded as a series of terms of the type 

A sin (X^ + c), 

where i denotes the time and A, X, e are independent of t but 
change from one term of this type to another. Each such 
constituent term gives rise to an oscillation of the sea, the 
oscillation having the same period 2n-/X as the term in the 
tide-generating potential to which it is due; and the height 
of the tide at any instant at any seaport may therefore 1>e 
represented as a series of terms of the type 

A' sin (X^ + e'), 

where the constant X is characteristic of the particular tide 
but is the same for all seaports, while the constants A' and e' 
are characteristic of the particular constituent tide and the 
partioulbr seaport. ‘ By analysing the observed tides at a 
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seaport hj means of a theorem similar to (4), we can find these 
constants A' and e'; we are then in a position to predict the 
tides at this port for all future time. 

A series of the type 

cos x + a^ cos 2 a; + ^3 cos +. . . 

+ sin x + b^ sin 2x + 63 sin 3a; +. . . 
is generally called a Fourier series, the coefficients a^, . 

being called Fourier coefficients: and the representation of a 
given function by means of a series of this form is called 
Fourier arholysis. The term trigonometric interpolation is 
perhaps more appropriate when (as in the present chapter) we 
are concerned only with finding a series with a finite number 
of terms which takes given values for a given finite number of 
values of the argument x. 

133. Interpolation of a Function by a Sine Series.—In 

the last article we have seen the importance, in Applied 
Mathematics, of a theorem which will enable us to analyse 
a given function into a sum of trigonometric terms. The 
problem was solved, at any rate in its simplest form, in 
1754-59 by Clairaub* and Lagrange,f who showed how to 
construct a sum of n tmgonometric terms, such as 

u{x) - sin X + 63 sin 2ic + 63 sin 3a? +. . . + b^^i sin [n - 1).'?;, (1) 

which will take given values for - 1 ) given equally-spaced 
values of the argument x ; say, 



where u^, • • •, Q-re given numbers. 

To effect this, Lagrange remarked that the sum 


. pic , qic . 2«7r . 2o'7r 

sin — sin — + sin sin +, 
n n 71 n 


. (n - l)pir . (n - l)qir 

. + sin ^ sin ^- 

n n 


(where p and q denote positive integers less than n) has the 
value \n when p is equal to q, and is zero when p is not equal 
to q. Therefore the function 

* Glairaut, HisL de VAcad,, Paris, 1754, p. 545. 

f Lagrange, Mise. Taurin. i. (1759), p. 1: reprinted CEuvrcs de Lagrange, 
i. p. 39; Mise. Taurin. iii. (1762-5), p. 258 ; reprinted (Euvree de Lagrange, 
i. p. 553. 
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oj sin^ + sin 2 a? sin +. 
n\ n n 


. + sin [n - l)ajsin 


%ip 


lias the value Up when aj = — and vanishes when = — where 
^ n n 

q is different from p : whence it follows immediately that the 

coefficients in ( 1 ) must he given by the equation 


n\ 




mv 

n 


+1^2 


2mir . In - 

-+ . . . + 7^n-ism^- ’ — }. 

n n J 


It should be noticed that the e:ipression ( 1 ) satisfies the 
condition that: 

1®. It takes the prescribed values u^^, . . Un-i At the given 
values of the argument. 

2 ®. It is periodic with period 2ir. 

3®. It is an odd function of a?. 

Suppose now that u(iv} is a function of a? which takes 
prescribed values u^, . . Un^i at the given values of the 
argument, but suppose that the function u(cc) is not periodic 
and is not odd; in such a case the expression ( 1 ) would have 
a graph agreeing more or less with the graph of the function 
u(x) between x = 0 and a?* 7 r, but the agreement would cease 
altogether for values of a? less than zero or greater than tt. It 
is important to realise that by this method of interpolation we 
can obtain an expression which agrees very closely indeed with 
a given function over a certain range of values of the argument, 
but which, outside that range of values, bears no resemblance 
whatever to the function. 

134. A more general Representation of a Trigonometric 
Series. —In the last article we obtained for the function u(ic) 
an interpolation formula which is formed of sines only. We 
shall now obtain a more general formula* which involves 
both sines and cosines, and which, moreover, enables us to 
make the best use of all the data in our possession, when we 
have more data than the minimum number required. 

Let it be required to find a sum 

Uo + ajcosic + agcos2aj + . . . + ayC0sra5i 

+ sinaj + Jg sin 2 a? + . . . + 6 rSinra 3 / ' ' 


t Bessel, KCnigO^erger BeobacMungeUt 1 Abt. p. iii. (1815). 
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which fwrnishes the best possiUe represeTUation of a fuiwtion 
u{x), when we are given that u{x) takes the values u^, 

Un-i respectively, when x takes the values 0, 

respectively: n being some number greater than 2r. The 
problem is to determine the (2r +1) constants 
a,., hy, so as to make the expression (1) take, as nearly as 
possible, the n values when x takes the values 

0, —, . . ^; so the equations of condition are 


27r 2. 2ir r . 27r 

^ + ^1 COS-+ «« COS --+ . . . + ttr COS- 

7b ^ n 7b 

, . 27r , . 2.2?r ^ . ?*. 2w 

^ 7b ^ 7b 7b 

2.27r 4.27r 2r, 2ir 

'*^2“®ii + ®iC0s-H-a-cos-+ . . . + a,.co 8 - 

, . 2.27r . . 4.27r - . 2r.27r 

+ A sin-+ 0, sin-+. . . + Oy sm-, 

7b ^ 7b 7b 


(w-l)27r r(7l~l)27r 

= + -+ • • . + «rCOS - 

, . (?l-l)27r , . r(w—l)27r 

+ by sin -'— +. . . + fer sm - 

^ 7b 71 

\ 

The normal equation for is therefore 
Wq + *1^1 + 

f, 2jr (»-l)2ir'k 

= n«^ + aj|l+ C08 —+. . . + co8 ^—^—j 

2.2t- 2(n-l)2v'\ 

+ «,{! + cos - +. . . + cos —*-— }• 

*l « n J 




+ ar|l+COS -^ + . . . + COS-^^- ^ -1 

, r . 27 r . 2.2?r . (w-l)27rl 

7b 7b 7b J 


+ 6r|8in 


. r .2ir 


+...+sm 


. r{7b - l)27r 


ll)^\ 

7b J’ 
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- ^ktr Ahv 2(n - l)fer . 

n n n 

when & is an integer greater than zero and less than n, and 

. 2fer . AJctt . 2{n — X)kir ^ 

sin— + sin— + . . .H-sin-'-- -= 0, 


this becomes 


+ V/2 + • • • + 1 — WffQ. 


The normal equation for is 


27r 2 • 27r (ti — l)27r 

t^A + WiCOS + 'M 2 C 0 S^-+. . . + i 4 n-lC 08 ^- ' — 

^ n * n n 


ZTT (n-~l)2ir'\ 

■=aJl+cos — + . . . + cos^- ^—} 

w n J 

. g.2ir ft ~ l)27r'\ 

n n ) 

, 2^r 2.27r 2.27r 4.27r 

+ a-i 1 + cos — cos-+ cos-cos-+. . • 

n n n n 

(w-l)27r 2{n^l)2ir\ 

ona '_ • nrka _'_ L _1 


, / 2jr . 2jr (n-l)2ir , (n-l)2wl 

4 6, { cos — sin — 4. . . 4 cos ^- - — sin '- - — > 

* I » « « n J 


+ . . . . 


Using the trigonometric formulae 

f, ,2ir ^47r .2(71-l)7r 1 

n n n 2 

. 2ir 2.27r 2.2tr 4.27r 

1 + COS — cos-+ 003-COS-+ . . . 

n n n n 


2ln - l)7r 4(71 - l)7r f. 
+ cos cos ^ « On 


2ir . 27r 2(71 — l)7r . 2(71 - l)7r . 

1 — Sin — +. . . + cos - — sin - ^-^ = 0, 
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this becomes 

2fl- 47r 

cos — + cos — + . 


. +W»_iCOS 


2{n - l) 7 r 
n 



The other normal equations may be obtained and reduced 

in the same way; finally we obtain the following set of values 

foT Uq, . . tty, 6y: 

1 ^ 
ttft==- 2 Wjfc 

® ^/c=0 

2 2kw 

tt, = - 2 U]^ cos- 

^ W fc=0 w 


2 2A7rr 

tty = - Z 1^1. COS- 

w fc=0 ^ 

, 2«-l . 2k7r 

fe, =- 2 wjbsm- 

^ n ic=Q n 


j 2 . 2krv 

Or-- 2 WjuSin- 

Wjfc=o ^ , 

1 12 

When r= ~n, the factor of ay in front of the symbol 2 is not 
2 n n 


( 1 ) 


The connection of this result with that of the preceding 
article is easily seen; in fact, the formulae of § 133 are merely 
those of the present article adapted to the particular case when 
the function u is an odd periodic function of its argument, so 
that u{x) = - ^(2ir - x). For if in the formulae (1) above we 
write n^2p and suppose that -Wgp.j, etc., 

with i^o = 0 , i^p = 0 , then the formulae ( 1 ) become 

ttj,= 0, ttjL=0, . . ., tty=0, 


2 f . mv . 2m7r 
= -{u. sin — + Wo sin-+. . . + sin 

^ p ^ tn r ^ 


p 


P J' 


which agree with the formulae of § 133. 

135. The 12-Ordmate Scheme. —In most cases in practice 
the number of given values w^, w^, Wg, . . . is either 12 or 24. 
Wo shall first consider the case when 12 values are given. 

Let it be required, then, to obtain an expression 

a^ + ttj cos a? + ttg cos 2 a; +. . . + ttg cos 5 a; + a^ cos 6 a; 

+ 6 iSina; + 62 sin 2 a; + . . . + 65 sin 5a; 
which takes given values w^, . . ., u^^ respectively when x 
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takes the values 0 , • 


reipeetivdy. Formulae (1) 


of the last article, written in full, are now 

12aj - + Mj + « J + Mj + + «^ + Mg + M, + + Mjl "I 

« */3 1 1 J3 *^3 

6ai = Mg + Ml • + Mg. 2 - M* • 2 - «6 • - 2' “ “6 - “? • 2 

^3 

^8*2"^ ’ 2 * 2 

„ 11 11 1 

6aj-Mg + Mi-^-Kg-g-M3-Mg.2 + «5-2 + «6 + «7- 2 

1 1^1 
“ **8 ■ 2 ~ **10 ■ 2 **u * 2 

6a3 = Mg-Mg + Mg-Mg + Mg-Mij 

c 11 11 1 

6a4 =■ Mo - «1 • 2 “ **a • 2 ■*■ "* " **« ■ 2 “ “«■ "2 ■ 2 


“ Mg • 2 + Mg — Mjg • 2 Mj 


fl V3 1 1 . ^3 V' 

Sa# “ Mg - Ml. ^2 + “a • 2 “ “o ■ 2 ‘ T ■ ' 2 


1 

11'2 
^/3 


1^ 1 
*** ‘ 2 ^ ’ 2 


^/3 

*n-V 

12a, «= Mg - Ml + Mg - Mg + Mg - Mj + Mg - atg + Mg - Mg + a«ig - Mu 

1 V3 ^ , V3 1 1 »/3 

“ Ml. 2 + Mg. + Mg + Mg • + Mg • 2 - M, • 2 - Mg • ‘*2 


73 


^-“u-2 


73 

2 


6ig - Ml. Y + Mg . ^ - Mg. V - 


,j/3 

2' 


73 
2 ■ 


+ Ma 


73, 73 

+Mg.^ 

73 


73 73 


6Jg = Ml - Mg + Mg - Mg + Mg - Mu 

», 73 73 73 

66g = Mi. i^-Mg. -2- + «4*^' 


73 73 

It. • TT- + M, • 

*» 2 1 2 


73 73 73 

“ **e ’ " 2 ” ^ **10 ’ "2 **11 * 2 

1 73 V3 1 1 73 

6*s“«i-2““a*X'''“»“**« 2 ■^“®’2““i'2'^“*‘ 2 

73 1 

— Mg + Mig • 2 ~ Mil' 2 


( 1 ) 
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“8 + «io = »a 
«3 + «9 ='»8 
“4 + «8 =®4 

+ =®5 


“l “ “u “ 

=tt>3 
“ 4 -“8 =«'4 

«g-«7 =^5 


the equations take the simpler form 


12^0 = + ®a + Dg + + ®s + «, 

. J3 1 1 V3 

6rti = M„ + ®,-^ + ®3-2-V4-2-'»6- 

- 11 11 

6«2 = % + ®i-2-®a- 2-'»3-®4- 2 + ®«‘2'*'“* 

6a3 = Mo-®3 + ®4-Mg 
« 11 11 

6a4 = M, - ®1 • 2 - - ®2 • 2 + ^3 - ^4 • 2 “-^6 • 2 


6«s 


V3 1 1 V3 

“o-^* 2 +®8’2"®*“2 ®»’ 2 


12a, = M, - ■»! + «8 - ®3 + ®4 - ®5 + '^8 

1 V3 JZ 1 

66 j = • 2 + Wg • ^ + W 3 + w, • Y" + “’s • 2 

V3 V3 ys ^3 

66a = Wi-^ + W2*^-w,.^-W5-2- 

6&3 = Wi-W3 + Ws 

2 


-Wk 


V3 


e&4=Wi*-^ - w,- 

/SI 1 -73 ^ n/3 1 

665 - Wi • 2 - W3 • Y + «’8 -'^4 • %-+ ^’s • 2- 


If we now write 


) + «0 
l’l + ®6=Pl 
«8 + «4 = P8 
a’3“P3 

«>, + W4=>r8 

Wa-r, 


^0 ^6 ” ?0 
®3-®4=28 

^8 “^ 4 “ *8 
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the equations take the still simpler form 


l2ffo=jPo+f»l+i’2+f'8 

-ys 1 

6«i = 2'o + ?i* 2 

6a3 = <?o-?8 

6a4=2>o-f»i'2“.P*-2'^l’3 

6«6 = 2o-?i-^ + ?*'2 
12a«=i’o-2»i+i’*-f’3 

66,=*,.^ + 8,-4^ 

6/^3 = 1\ — ?3 

^/3 

— Sj • • 2 ^2 ’ 2 

665 = 7*j • 2^2 * ^8 


Now write 

1 I 1 L 1 7 1 

2 P 1 "" ~ 

^/3 7 73 Ji ^ 

—2i — ~ 2 2 

(Note that ^“0’366 = 1--^-^, which enables the 
IS 

multiplication by ^ to be performed mentally.) 

Then the equations may be written 


12aQ^Po+P2+Pi+PB 
12a^^Po+P2-Pi-Ps 
6^1 “ ?o + ^2 + 
6a6«?o+^a""^ 
6a4=2?o+2^3-,^i-^2 
W2--Po + *i-jP3“*2 


66j * + 7*3 + Wg 

65g = mj + r8-TOg 

6ff3=?o-y* 

6&j = «i + m2 

6h^=n^-n^ 

6J3 = rj-rj 
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These equations give the coefficients 65 in a con¬ 

venient form; the qomputation may conveniently be carried 
out on a printed sheet of which the arrangement is shown in 
the accompanying specimen: * 

Ex, 2 .—Given 


’*0 

Wi 

W2 

Ws 

W 4 % 


W 7 

^8 

W 9 

W,o 

«u 

0 

0-262 

0-524 

o 

00 

Oi 

1.04711-309 

0 

-1-309 

-1-047 

-0-786 

-0-524 

- 0-262 


express u(x) as a Fourier aeries, 

[Ans. 0*977 sin x - 0*453 sin 2a; + 0*262 sin 3a; - 0*151 sin 4a; 

-f 0*070 ^in 6a^] 

136. Approximate Formulae for Rapid Calculations. —In 

many cases the harmonics above the third {i,e. those with the 
coefficients ^ 4 , 64 , may be neglected, the function 

being capable of representation with sufficient accuracy by 
an expression 

u{x) = aQ + cos x + 2x % cos 3a 
+ 61 sin a? + 62 sin 2x + \ sin 3a? 



It was pointed out by S. P. Thompson f that when this is 
the case we can calculate the coefficients by simple averaging 
of the data without any multiplications, in the following way: 

First, as we have seen in equation ( 1 ) of §135, the co¬ 
efficients flg, Jg are given by the equations 

«^0=-f2K + ^l+^2 + '*^3 + ^4 + ^5 + ^6 + ^^7 + ^8 + S + ^10 + %M2) 


«3 = -^ (“0 - + “4 - ’*6 + - «lo). 

^8 = ^K-'«3 + “6-'“7 + “9-'“ii)- 


(3) 

(4) 


Next, we have, by putting x successively equal to 90° and 270“ 
in equation ( 1 ) above, 

«3 = " o -«2 + ^-M 
whence , % - «»= 2(ii - Sg), 


* The computation form has been designed by help of the suggestions 
derived from many writers, among whom particular mention should be made of 
0, Runge, Zeits, fUr Math, u, Phys, 48 (1903), p. 443, 

t Proe, Phys. Soc, London, 23 (1911), p. 334. ^ 
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and therefore 

6 l — J (W 3 tij) + Jj. 

(6) 

Next, putting 

X equal to 0 ° and 180° in ( 1 ), we have 



«0 = «0 + «l + ®2 + « 8 \ 

(7) 


«6 = » 0-«1 + « 2 -«sI’ 

whence 

«o-««= 2 (ai + a3). 


and therefore 

«i = H“o-“e)“«s- 

( 8 ) 

Equations (5) 

and ( 7 ) now give 



«2 = i(«o -“8 + «6-«9)- 

(9) 


We have still to find For this we shall suppose that the 
complete graph of the function u(x) is known, so that we can 
read off the ordinates at anj point on it; let us read off the 
values at a:=45,° 135°, 225°, 315° and call them ua>us,uiTe- 
epeotivelj. Then, putting » = 8 in equation (1) of § 134, we have 

T 1 I - . ^ 

= i ^ M* sm-n 

fe=0 " 

or + (10) 

Hquatiom (2), (3), (4), (6), (8), (9), (10) give the coefficients 
Oj, ttj, Oj, Jj merely by forming averages of the measured 
ordinates of the graph of u{x). 

Ex,—To find an approximate formula for the Fourier series which 
represents the following observations: 


«0 

1*1 

i*a 

% 

«4 



f*7 

1*8 

1*9 

WlO 

Wjl 

2>714 

3-042 

2*134 

:i-273 

0-788 

0*495 

0-370 

0-540 

0*191 

-0-367 

-0-437 

0*767 


Forming the sum of the entries, we have 


12aQ=:u^j + Wi + W2 + * • • + % 

= 11-520, 

80 a 0 = 0-960, 

and we now form the following sums : 

6 a 3 =Wo ““'**2 + ^ 4"’^6 + ’^ 8 “^io» whence a3=:0*271 
e &3 = - «3 + Wg - + Wj - Wji, whence 63 = 0-1 

= + whence 61 = 0-916 

= 1(^0 *" whence = 0-901 

4^2 = «o^ whence a^ — 0-642. 

Finally, forming a graph of the function and reading off the ordinates 
u^ corresponding to the arguments os = 45**, 135^ 226^ 316** 
respectively, we have 

462 = — 1*3 + Wg — 

= 2-36, 

ft2=0*69 (approx.). 


whence 
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Thus an approximate formula for the required Fourier Bcries is 

0*960 + 0*90 cos X + 0*54 cos 2x + 0*27 cos Zx 
+ 0*92 sin X + 0*59 sin 2a5 + 0*1 sin Zx. 


137. The 24-Ordinate Scheme. —We shall next consider the 
case when 24 values of u(x) are given, corresponding to £c«=0, 
15°, 30°, . . ., 345°. Denoting these by u^, u^, . . ., the 
problem is obtain an expression 

% + cos x + a^ cos 2x + . . . + cos 12x. 

+ sin a? + Jg sin 2u; +. . . + b^^ sin llx 

which takes given values u^ ...,^^23 respectively when x 

takes the values 0, . ., respectively. 

If we form the sums and differences of the u% thus: 


* ^0 ^1 ^2 ^3 • * • ^11 "*^12 

^^23 "*^22 ^21 • * • ^13 

Sums Vq v^ v^ v^ ... -yjj v^^ 

Differences ... 

then the formulae (1) of § 134 applied to this case may be 
written 

24^0 = Vq + + ^*^2 + ^8 + ^4 + + ^6 + ^7 + ^8 + ^9 + ^10 + % + ^12 

12a^ — Vq + v^ cos 15° + 7;2 oos 30° + v^ cos 45° + v^ cos 60° 

+ v^ cos 75° + Vq cos 90° + v^ cos 105° + v^ cos 120° 

+ Vg cos 135° + VjQ cos 150° + % cos 165° + v^^ cos 180° 


12&1 = sin 16° + sin 30° + w^ sin 45° + w^ sin 60° + w^ sin 76° 
+ Wq sin 90° + W.J sin 105° + sin 120° + sin 135° 

+ sin 150° + sin 166° 


Now form the sums and differences of the 7?*s thus: 

% % ^4 % % 

^12 % ^10 % % ^7 

Pc Px Ps Ps Pa Ps Ps* 

So ?8 ?3 ?4 ?6 


(d 811 ) 


10 
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and form the sum and dilfeiences of the w’i thus: 


Wj Wg Wj W^ Wg Wg 

Wli Wm W, Wg Wg 

’•l »-2 ^3 U H 

^2 ^3 ^4 ^6 

Then the equations become 

24ao=l)o +l’i +!»* +l’s +^^4 + J’s +1»6 

12 aj =• Jo + 2 i ?2 + ?8 + Si 

+ C08 VS" 

12 a 2 =Pq+P i cos 30°+^)j cos 60° +^>3 cos 90° + JJ 4 cos 120° 

+p^ cos 150° +|>g COB 180' 

12 a 3 = jg + COS 45° 4 cos 90" + q^ cos 135° + q^ cos 180° 

+ jg cos 225° 

12 a 4 =^g + Pi cos 60° +P 2 cos 120° +P 3 cos 180° +p^ cos 240° 

+^g cos 300° +pg cos 360° 

12 ag=Jo + 2 i cos 75 ° + q^ cos 150° + q^ cos 225° + q^ cos 300° 

+ jg cos 375 ° 

12rtg=2)g+2>iC08 90° +^gCos 180° +JP 3 COS 270° TiJgCos 360° 

+p^ cos 450° +|?g cos 540° 

12ag=?o + 2i cos 105° + jg cos 210° + q^ cos 315° + jg cos 420° 

+ ^g cos 526° 

12fl,,=»n +P\ coa 120° + »g cos 240° + Wg cos 360° 4 ». cos 480° 

4-^g cos 600° 4-cos 720 ° 

12a, == 2o + 2i C03 135° 4 - cos 270° 4- q^ cos 405° 4- q^ cos 640° 

4 - jg cos 675° 

12aj, ^Pft+Pi cos 150° 4-j>g cos 300° 4-^J3 cos 450° 4- p, cos 600“ 

4-2)g cos 760° 4- pg cos 900° 

12a„ * Jo 4- jfj cos 165° 4 - j, cos 330° 4 - jg cos 496° 4 - j, cos 660° ^ 

4 - jg cos 826°»;' 

24ajg=j>o +Pi cos 180° 4-2>g cos 360° 4-2>3 cos 640° 4-^?4 cos 720° }i' 

4 -j>g cos 900° 4-p>g cos 1080'', | 

126j=Tj sin 16° 4- rg sin 30° 4 - rg sin 45° 4- sin 60° 4 - rg sin 76° 

4- rg sin 90° 

] 26g <= «g sin 30° 4- «g sin 60° 4- *3 sin 90° 4- 84 sin 120° 4- Sg sin 160° 
I 2 J 3 =rj sin 45° 4- rg sin 90° 4- rg sin 135° 4- rg sin 180° ( 

+ rg sin 225° 4- rg sin 270° | 
1264 >■ 8| sin 60° 4- Sg sin 120° 4- Sg sin 240° 4 - ^ sin 300’ I 
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126g — rj sin 75® + rg sin 150® + sin 226® + sin 300® 

+ 7*5 sin 375° + 7*5 sin 460® 
1265 =s sin 90® + 53 sin 270® H- sin 450® 

1267 « rjL sin 105® + sin 210° + 7*3 sin 315® + sin 420® 

+ rg sin 526° + sin 630® 

126g = Sj sin 120® + sin 240® + sin 480° + sin 600° 

12 \ = 7*1 sin 135® + sin 270® + sin 405" + sin 540° 

+ 7*5 sin 675° + 7*5 sin 810® 

126io = Sj sin 160° + sin 300° + S3 sin 460° + s^ sin 600° 

+ S5 sin 750® 

126^ = 7*1 sin 165® + sin 330® + 7*3 sin 496® + sin 660° 

+ rg sin 825° + sin OOO"" 

Now form the sums and differences of the jp’s, thus: 



n 

Pi 

Pi 

P3 


Pb 

Pb 

Pb 


Sums 

Differences 

h 

Wlfl 

\ 

h 

~h 


Then we have 

24^3 = + ?2 + h' 

12a^ = ttIq 4 - 772jl cos 30® + cos 60°, 

12n:.4 = /q + ^1 h 120° + ^3 cos 180®, 

12^15 = 77I3 + TTlj cos 90° + 7^2 cos 180®, 

I2a^ + cos 120® + ^2 cos 240® +cos 360®, 
12 ^j3 = ttIq + cos 150° + cos 300°, 

24^^12 “ ^0 180® + ^2 cos 360 +cos 640 , 

24^3 = ^3 + + ^2 

12^4 = ^0 2^1 2 ^* * 

12^3 = ?3 “ 2^1 — 2^2 ^ 3 > 

24^12 ~ ^0 *” ^2 ” ^ 3 * 

n/3 1 

12«2 + 

12a3 =7713-7712, 

.0 n/3 . 1 

12^10 ^0 *” 2 ^22* 
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Next foim the sums and differences of the s’b, thus: 


«8 


Sums kg 

Differences ^2 

Then we have 

'126j = sin 30” + kg sin 60* + sin 90®, 

1264 = sin 60” + Tig sin 120°, 

126, = Aj sin 90° + kg sin 270°, 

126g = Jij sin 120” + sin 240”, 

126i, = Aj sin 160” + A, sin 300” + ^ sin 450”, 

\2bg^k,.l + kg.^ + kg, 

121)^ =^1 — ^ 3 , 

or i = ^1*2” ^'2 * ^ ^3» 

v/3 

2 ^ '•'2' “2’' 


1264 = + TO, 


Now write 


1 07 

12b^ -rii-^-712-^2-. 


2^1 ” » 2^2 > 2^^^ ~ ’ 


Zh —Jc^ 

~ Tt'i > 




Then the equations become 

24aQ = (/q + /g) + (?i + /g), 

24«i2 “ (^0 4) “ (^1 + ^3)> 

12^4 =(^o + V)-(y + y. 
i2a3=(?,+g-(v+?;), 

12a2 =(mo + mg') + m/, 

12rte = ^0 - 

12&g =(*/ + *3)+V. 

12 & 1 Q = (^1 + — /Cg I 

1 2 6 3 = 

1 2 6 4 “V + Wg', 

« 1268 =7l/-Wg^ 
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In order to calculate the coefficients with odd suffixes we 
write 45* - 30® for 15°, etc., and so obtain 

12ai = (?o + %) + 2(52 “ 52) "4 (?2 Si)’ 


12<i, 


V^® 72 / V2 J2' 


V2’ 


12«5 !7o J®2 2( ■ %, ■*■ ^* ■*■ ^*2) ■'■'2(52 52)’ 


72 


V2^ 


^>/2 




1 2S = ?o + +1(72 - 72) + ^ ( -32 ~ ■ S' 


72^ 


72 


72 ^ 


12«, =?o-7^ + ;^-?4 + 7|. 


12a, 


-a -3? +l(_ h +a +3^\ + J-^(-3y. + a -3&\ 
ii-Jo ^2 2V 72 72j 2 V 72^^* 72/ 


72 


72 ;" 2 V 72 


72> 


Now write 




72 


= 7i. 


.23 = « ' 26_ = o ' 

72 23 . ^2 ^®’ 
1 *73 

?1 ?6 2^4 “?4> 


2l 25 ■” ^2» 2^2 “* ^2 > ■” > 

“^^2 “A» ?4 ■^?2 

” h ^fv 9'4 ■" ^2 “ ^2* 

Then the equations become 

12(ii =?o + ^i + 9^3+/n 
12 «u = ?o + ^i-? 3 “/p 
12 rt 3 = (Zo *“ ?4 ^2 “ ?8 » 

12^3 = ?0 ?4 "" ^2 ?3' 

12 ag = 5^0 ^2 ■” ?3 

^12^7 = !Zo ^2 ?3 ""/2* 

We have also 

12Jj = (rg + ^) + 2(-^ + ^ + rj)+^(^ + ^ + r4), 


126 =-l-4.r 4.78.__i-_f 

^ * 72 ^ *^72 72 *’ 


125, J 2 + 2(72 72 ■'■ ^*) ■*■ ^ (72 ■*■ 72 ■ ’’®)’ 


72^72 
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^2 72 


126o 




.-n + 


IS _ 


+ r«, 


®“^2 2-^2 ^^2 •’ 

i 2 &„ = - ^6 +j 2 2 ( ~ 72 “ ’’*) ■*■ *2 ( 3 ^ 'j2 ~ ’**)' 

r. 


Put 


V 2 ./2 *’ 72 ®’ 


'•/ + »'6=^1. = = r3'-A,'=y,. 

ri-r^ = K 1^2 =<. ^^4 “’*4. »'6 + »'2'“y2' 


Then the above equations become 

( 126, = (;-, + V) + (;-2 + a 

126n=(.;i + V)-0‘2 + <). 

126s=(A3 + r3) + 0-3-rg), 

1263 = (Ag + *’ 3 ) ~ (*’3 “ ^e)> 

i2^=(A,'-y,) + C;2-0. 

126, = (A,'-yj-C;g-r;). 

The calculation is made on a computing form arranged 
as shown in the sheets inset: 

Eq^ 2 .—Fijid the Fourier expreseion for u(0)y given 


Wo 

Wl 

Wa 

W 3 

W 4 

Wo 

We 

W 7 

Wg 

W 9 

Wio 

Wil 

300 

401 

373 

241 

73 

-66 

-100 

-66 

0 

41 

27 

-34 


«12 

WlS 

Wl4 

Wm 

Wjo 

W,7 


Wl9 

Wao 

Wsi 

Wsa 

Was 

-100 

-127 

-100 

-41 

0 

-18: 

-100 

-207 

-273 

-241 

-100 

107 


[Answer, 100 (sin 0 + cosO^ sin 26 + cos 26 + sin 3^ + cos 3^).] 

138. Application of the Method to Observational Data.— 

1®. Adjustment to Period .—In applying the above-described 
method of practical Fourier analysis to observational data, we 
have first to find the values of the argument at which the data 
^o» ^a> • • •> ^83 taken. Suppose, for instance, that 

the observed period of the phenomenon is 185-28 days. Then, 
since ^ of 185-28 is 7-72, if we take to be the value of the 
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observed quantity at the instant we must take to be its 
value at the instant + 7*72 days, while will be its value at 
the instant + 15'44 days, will be its value at the instant 
+ 23‘16 days, and so on. 

2®. Allowance for Secvlar Change .—In many oases the pheno¬ 
menon whose variation is to be studied is not strictly periodic: 
thus if the numbers to be analysed represent hourly means of 
some meteorological phenomenon, the means for hour 0 will not 
in general be the same as the means for hour 24. This difference 
is allowed for in practice by applying a correction to each of the 
terms except that for noon. 

3®. Allowance for the use of Means. —In many cases the data 
from which the Fourier expansion is to be computed are not the 
actual values of the ordinates corresponding to the values 

2ir 

0, — ... of the argument, but the mean values of the ordinates 

n 

taken over certain intei^als. Thus if we wish to find the curve 
which represents the annual variation of temperature at a given 
station, we generally take as data the mean temperatures of the 
twelve separate months. It is evident, however, that if we were 
to calculate the curve directly from these data, taking the tem¬ 
perature on the middle day of July to be the mean temperature 
of July, we should introduce an error, since the average tempera¬ 
ture on the middle day of July is not the same as the average 
temperature over the whole month : in fact, the curve obtained 
from the means would be too low in summer and too high in 
winter, the true curve beiug external to the curve of means. 

We can deal with this difficulty by applying a correction to 
the data in the following way: 

Let Wp-i, be three successive means, each taken 

over an interval 2c; and let u^ be the true value of the function 
for the middle of the interval over which is taken, so that 
‘ is the quantity which should be substituted for as a 
datum from which to construct the Fourier representation. 

We shall suppose that the function may be represented with 
sufficient accuracy for values of the argument in this region by 
an expression 


2hx + 
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where a, b, c are constants, and where x is the argument 
measured from the middle of the interval over which is 
measured: we have therefore 


u 

p 


-a, m. 


‘ndx, udx, %+i = ^/ 

/- 8 « J-t € 


udx, m„ 

Performing the integrations, these equations give 

mp_i = a - 4&« + 13ct*, 
mp+i = a + 46e + 13c«®, 
wip =a+ C£®, 

12\ P 2 ) 24 ® 


udx. 


whence 
and therefore 




1«! I/ + 


That is to say, in order to convert the given mean into the 
true ordinate corresiponding to the middle of the interval over 
which m^ is taken^ we add a correction equal to one4welfth of the 
eoocess of over the mean ofm^^j^ and m^.y 

An alternative method is to compute the Fourier expression from the 
given means and then multiply all its periodic terms (^.6. all its terms 
except the constant Uq) by a factor which represents the ratio of the 
amplitude of the true curve to the amplitude of the curve of means. 

An Example of Harmonic Analysis, —In the accompanying example 
the data are taken from observations of the magnitude of the variable star 
RW Cassiopeiae ; * the magnitude of the variable star is denoted by m. 

The curve represented by the harmonic formula is drawn in the 
figure. The observed magnitudes of RW Cassioj)eiae are also given in 
Fig. 18 for purposes of comparison. 

139. Probable Error of the Fourier Ooefflcients.— Knowing 
the probable errors of observation affecting the data u^^Uy ..., 
we can easily calculate the probable errors of the values 
deduced for the Fourier coefiScients a^ hy ... . For 
(§§ 89, 94) if € is a linear function of u^, Uy . . ., say 
+ + * • • + ^n^n» if probable error of each of 

the quantities u^, Uy . . u^ is q, then the probable error 
of € is {X^ + Aj* + +. . . + Thus in the 24-ordinate 

scheme, since 2AaQ-UQ-^u^ + u^ + . . .-^u^ the probable error 
of Oq is 2/^24 or 0*204$'. 

* £. T. Whittaker and 0. Martin, Monthly Notices^ RA.S. 71 (1911), p. 511« 





•• 0 |- 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 



Period used, 14*81 days. Observations falling exactly on the curve are denoted by strokes. 
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A sine term and a cosine term in the Fourier representation, 
\rhich have the same period, are often combined into a single 
term thus, 

<icos0 + 6sin0 = R8in (0 + o), 


where R - + V) and cos a = 6/R, sin a = al'R: we then require 

to know the probable error of E and a, assuming that the 
probable errors of a and 6 have already been calculated. 
Suppose that the probable errors of a and b are each of 
amount e. Then, since 

_ aSa + &86 

we see that the ’probable error in R is e. Moreover, since 

^ bBa-aSb 


the probaUe error in a is 

e 

It is important to have clear ideas on this subject, since 
otherwise there is a risk of carrying the computations to more 
digits than is warranted by the degree of accuracy of the data. 
This remark applies particularly to the computation of a. 

140. Trigonometric Interpolation for Unequal Intervals 
of the Argument. —^Lastly, we shall consider the representation 
of a function u{x) by a trigonometric interpolation formula, 
when the values of the function are known only for a set of 
values a, b, e, . . ., m, n of the argument, which are not at 
equal intervals apart. 

The problem, which is analogous to Lagrange’s problem in 
ordinary interpolation (§ 17), may be solved in more than one 
way: it is readily seen, indeed, that any of the following 
expressions will serve: 


1 «. 

u{x) 


_ sin|(a;-ft)sin|(a;-c) . . . sin^(a!-n) , . 
"sin J(o-ft)sin|(a.-c) . . , 8inJ(a-n) ' ' 
sin 1 (a;-a) sin4(a;-ft) ... sin4(a:-m) , 


* Cauchy, Comptes rendui, 12(1841), p. ^SZ = (Euvrea (1), 6, p. 71. 
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2®. 


3® 


(cosic-cos6)(cosa?-cosc) . 
(cosa-cos6)(cosa-cosc) . 
^ (cos X - cos a) (cos x - cos b) . 
(cos n - cos a) (cos n - cos b) 


(cosic-cosw) , . 

-)- \u\a) +. 

, (cosa-cosw) ' ' 

. (cosa;-cosm) 

. (cos n - cos m)' 


. , sinaj(cosi?:-cos J)(cosa;-cosc) ... (cosaj-cos^) , . . 
ulx) = - ) - {u(a) + . 

' • oin n lnf\a n _/t/\a /il Inna n _ nr\a Inna n _nna /n.\ ' 


sin a (cos a - cos b) (cos a - cos c). . . (cos a - cos n) 

U(«)- 


sin£c(cosa?-cosa) . . . (cosaj-cosm) ^ 


4®. 


u[x) ■ 


sinw(cos71-cosa) . 

sin {x - J) sin {x-c) . 
’ sin (a - 6) sin (a - c) . 
sin (.t* - a) sin (x - b) 
sin (n - a) sin (n - b) 


(cos n - cos 
sin (x - n) 


sin (a - n) 

. sin (x - m) 
. sin [n - m] 


u[a) +. . 


Ex. 1.—Discuss the relation of the above formulae to the formulae 
obtained for equal intervals of the argument in §§ 133, 134. 

Ex. 2 .—By making h tend to equality with a in formula 1® above, 
obtain an interpolation formula for the case when the function and its 
first derivative are both known at a; = a, while the function alone is 
known at a; = c, d, . . . . 

(This is useful when we happen to know the values of the argument 
for which the function is a maximum or a minimum.) 

Ex, 3.—Apply the first of the above formulae to obtain an expression 
for 

/.2ir 

u(x)dx. 


f 


(Use the formula 


J r2ir 1 1 1 N J V ^ 

I sin-(a;-a;2)sin-(a;-a;2) ••• ^ V> 

0 

where 8 denotes ajj + • • • 'h Sp is the sum of p of these x s.) 

[Baillaud, Toulouse Ann, ii. (1886), B.] 


Miscellaneous Examples on Chapter X 

In each of the following examples it is required to find a Fourier 
series for v{x). The tabulated values of u{x) represent equidistant 
ordinates spaced at intervals of of the complete period. 

* These two formulae are duo to Gauss, Nachlass, Werke (1866), iii. pp. 
291, 292. 

t Hermite, Cours analyse. 
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Ex, 1.— 


% 

«i 

«*!«s 

% 

«e «6 


«8 

u. 

«I0 

^11 

«12 

^13 

«14 

4 

4 

38|68 

79 

92 ( 

«17 

111 

! “ 

I'll 

18 1 

> 

94 

118 

115 

73 

^82 

86 

86 

78 


77 

86 

74 

43 

60 

35 

35 

20 

-10 



Ex. 2.— 



«1 

«2 

“a 

“4 

«5 

“6 

Uj 

“8 

«9 

«10 

j “ii 

’Ha 

1 

11 

50 

105 

138 

158 

159 

150 

139 

142 

143 

1127 

134 


«14 


“m 

Wl7 

^18 



^21 

^22 

«28 

135 

150 

160 

134 

120 

120 

100 

72 

66 

30 


Ex. 3.— 


“ol 


"2 

“8 

«4 

“6 

«8 

“7 

«8 

«9 

“lO 

“u 

^12 

«18 

25 

4 

u 

| 3 £ 

14 1 

i ! 

60 
«16 1 

45 1 

^10 

1 

50 

4i 

L 7 1 

3 1 

68 
“18 1 

118 

«19 

148 

^20 

160 

122 

’^22 

125 

^23 

139 


125 

85 

82 

60 

24 

20 

28 

45 

56 

40 



Ex. 4.— 


«0 

«i 

«a 

«8 

«4 

“5 

«6 

«7 

«8 

“9 

«10 

«u 

«12 

170 

142 

160 

171 

129 

139 

130 

150 

181 

179 

170 

148 

106 


«18 «U 


«16 

«17 

^18 

^19 

« S0 

«21 

“iB 

^23 

76 1 74 

7 

16 

34 

48 

69 

106 

130 

146 

160 


ADDITIONAL REFERENCE 

G. G. Danielson and C. Lanozos, ** Some improvements in practical 
Fourier analysis ”, J. Franklin Inst. 233 (1942), pp. 365, 435. 


[Copies of (he Computation Sheets facing pages S70 and 
S78 may be obtained direct from the Publishers in quantities 
of not less than one dozen^ at (he price of 2s. 6d. per dozen 
sets. Special terms will be quoted for orders of one thousand 
and upwards.'\ 










CHAPTER XI 

GKADUATION, OB THE SMOOTHING OF DATA 

141. The Problem of Graduation. —iSuppose that as a 
result of observation or experience of some kind we have 
obtained a set of values of a variable u corresponding to equi¬ 
distant values of its .irgument: let these values be denoted by 
^11 ' 2 ^ 2 * been derived from observa¬ 

tions of some natural plienomenon, they will be affected by 
errors of observation; if they are statistical data derived from 
the examination of a comparatively small held, they will be 
affected by irregularities arising from the accidental peculiar¬ 
ities of the Held; that is to say, if we examine another field 
and derive a set of values of u from it, the set of values of u 
derived from the two fields will not in general agree with each 
other. In any case, if we form a table of the differences 

. . ., A^i = AWg■" 

generally be found that these differences are irregular, so that 
the difference table cannot be used for the purposes to which 
a difference table is usually put, viz. finding interpolated values 
of u, or differential coefficients of u with respect to its argument, 
or definite integrals involving u. Before we can use the differ¬ 
ence table, we must perform a process of " smoothing "; that is 
to say, we must find another sequence Wg', . . Un whose 
terms differ as little as possible from the terms of the sequence 
^ 1 , . . ., Un, but which has regular differences. This smooth¬ 
ing process, leading to the formation of is 

called the graduation or adjustment of the observations. 

For example, let us consider an extract from the Government Female 
Annuitants (1883) Ultimate Table and form a difference,table of the 

285 
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entries. We have, denoting by probability of a person aged as 

dying in a year, 


as (Age). 


A. 

50 

1019 

631 

51 

1660 

61 

52 

1611 

142 

53 

1753 

19 

54 

1772 

-224 

55 

1548 

474 

66 

2022 

-99 

67 

1923 

-81 

68 

1842 

487 

69 

2329 



The differences are altogether irregular, and, as we shall see(§ 155), 
when the data are adjusted the differences become more regular. 

In dealing with experimental results of no great accuracy 
the smoothing process is generally performed graphically,* but 
in the present chapter we shall be concerned with more refined 
methods involving analytical formulae. 

142. Woolhouse’s Formula of Graduation.— We shall 
consider first a formula which, though now disused in practice, 
is of considerable historical and theoretical interest. Wool- 
house proposed f to pass five ordinary parabolas through the 
five sets of points 

* For the best graphical method cf. T. B. Sprague, J , T , A . 26 (1886), p. 77. 
For valuable comments on such methods cf. Whewcll, Novum Organum Re- 
ntmtum. Book III. oh. vii. p. 204 of the edition of 1858. 

t J-IA. 15 (1870), p. 389. InJ.LA, 28 (1882), p. 351, G. F. Hardy showed 
that the calculations required for the application of Woolhouse’s formula might 
be performed by a “columnar” process of calculation, in the form 
W = 2(1 ““ 86*) A5*S*u, 

which is identical with the form (tHIPKI - 36*)m, given later by him, since 
^s2=?[5]; and T. G. Ackland, J.L A * 28 (1882), p. 352, showed that they can 
be performed by summatioxiB of a different nature. 
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and take as the graduated value of u the arithmetio mean of 
the values derived from these five parabolas. Now using Aj 
to denote the operation A performed with the interval 5, hj the 
ordinary interpolation formula (§ 21 ), the values derived from 
these five parabolas are: 


2 3 

«0 = «-2 + gV'-2-25^*’'-7 

(from u.^, M.,, Ttg), 

«0 = “-l + gV-l-2g^5^-6 

(from M.j, M 4 ), 

^'0 ~ '^0 

(from M.j, Mg, Mj), 

«o“«i-gV'i + 25A^«-« 

(from M. 4 , «i, M,), 

2 7 

«0=«2-gV2 + 25V“-s 

(from Mg, M,), 


Therefore if denote the graduated value of u^, we have 

2 112 3 

- IgV'^-e + ( 1 ) 

where [5]ifo stands for + ''V 

The operation of replacing a term Ur by the sum 

n-l + ' * • + + . n-i 

will be called summation hy n\ and denoted by the operator [n], so that 

[W>Q=W_^^+. . . + ?io + . . . + ^11-1. 

2 2 

We shall use [?i]* to denote the effect of performing this operation 
twice in succession, so that, for example, 

= 1*4 + 21*3 + + 3^-2 + ^^-3 + ^- 4 - 

Since Ajtt.g = [5]A«o, and ^ 5*^.4 = [5]®AX, equation (1) maj 
be written 

j-gj V = “0 + - ^[5]A®W-8 - 

+ 2|[6]AX + ^[5]AS 

“ - b[ 6 K+g[5K - 

+ ^[ 6 ]A*«,. 
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so pp V “ “ “ 3 A*w_3 - 2A*m., + 3A*M3 + 7A*Mj 

= - 3m_3 + 4w_2 + M.J + V.3 + + 4^2 - 81/3 

- - 3[5]m_i + 7[5]«o - 3[5]tti. 

Thus Woolhouse’s formula of graduation may be written in 
the forms 

[51* 

< = - 3%-! + 7% - 3tta+i), 

or * «*' = - 38*Ma), or < = “ 3[3 ]}m*, 

or tta' = j|g{25M* + 24(%_i + Ma+i) + 21 (m *.2 + u^+s) 

+ 7(«»-8 + «*+») + 3(Ma,-4 + Ma+ 4 ) — 2{Ug.^ + Ug+t) 

- 3{ug.i + «*+?)}, 

or Ux « 0 * 200 m-/b + 0 » 192('Mjp«i + 0 * 168 ('M-{p -.2 '*^*+ 2 ) 

+ 0*066(‘M’/b_3 + + 0*024(‘14jb_4 + 

- 0>016(ux^e + %+ 6 ) - 0-024(^^»-.7 + ^^*+ 7 ). 

143. Summation Formulae, f—The formulae of Woolhouse 
may be regarded as a particular instance of a class of gradua¬ 
tion formulae, much used by actuaries, which may be called 
sumTjmtion formvlae^ and which are based on the following 
principle. 

Let A denote the operation of differencing, so that 
and, as in §142, let [2m+ 1]^* denote the 
sum of (2m +1) of which is the middle one. Then it is 
po^ible to find combinations of these operations A and [ ] 
which, when differences above a certain order are neglected, 
merely reproduce the functions operated on; so that we have 
(say) 

/[A, [ ]}Ma =» «« + high differences. 

We now take /{A, [ ]}«* to be the graduated value of «*, 
that is. 

[ ]K, 

the merit of this uj depending on the circumstance that 

* Tins form is doe to Hardy. J.l.A, 82 (1896), p. 372. 
t On Bnmniation Formulae cf. G* J. Lidstone, JJ.A» 41 (1907), p. 848, 
49 (1908), p.«106. 
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/[A, [ ]}Wa; involves a large number of the observed u\ whose 
eiTors to a considerable extent neutralise each other and so 
produce a smoothed value uj in place of Vx- 

In practice, instead of the symbol A, it is generally con¬ 
venient to use the symbol of central differencing 3, where 
denotes Uj^ - 2 wq + Writing E = we have 

8 * = E - 2 + E“^ = - 4 sin^ </», 

so that 


U^rn, + + • • • + % + * . . + 

^ + g-2t> ^ J _j. g2i^ ^ ^ 

= (1 + 2cos 2<^ +2cos4<^ + . . . + 2 cos 2m<f>)uQ 
_sin {2m+ !)<!> 

"" sin <l> 
and therefore 


r T sin nd} 

[n]uQ^—. — r-Wn 


8 in<^ 




71 (^ 2 - 12 ) 


3! 


SlU 


■9+ "5] 


2 _ ' 


sin^ <l> -. 


•)«o 


or 


■w,.. 


«0 + 


, o i/o + 2*. 5 ! ^ • 


This shows that + f^rms in 

8 ® 7 /q, . . . and therefore a sumviation formula, correct to third 
differences, is 

^ p.q.r { 24 j® 

Taking any two formulae of this type, and eliminating 8*, we obtain 
summation fonuulae of a type first introduced (but otherwise demon¬ 
strated) by J. A. Higham, JJ,A, 23 (1882), p. 335; 24 (1883), p. 44; 
25 (1884-85), pp. 15, 245. 

Formulae correct to fourth differences may be deduced by the above 
method. The use of a formula correct to too low an order may lead to 
systematic distortion of the results. 

It may be remarked that formulae such as Woolhouse’s, which are 
based on interpolations, may all be reduced to the summation type; but 
the converse is not true, so the summation method is the m^re general. 
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144. Spencer’s Formula. — Perhaps the best of the 
summation formulae of graduation correct to third differences 
is the 21-term formula of Spencer* namely, 




This may evidently be obtained by taking p = 5, 2 = 5, 
r = 7 in the preceding formula. We shall now obtain its 
expanded expression. If we perform summation by 7’s on 
2(1 -48*-38*- J8®) or (-E® + E + 2 + E"*-E"®), we obtain 

- E® - E« + 2E» + 3E* + 3E + 2 + 3E-* + 3E-* + 2E-® - E-« - E'®, 
and if on this we perform summation by 5 ’b twice, we obtain 

- E“ - 3E® - 5E« - 5E^ - 2E® + 6E® + 18E* + 33E® + 47E* + 57E 

+ 60 + 57E-* + 47E-* + 33E-® + 18E-* + 6E-® - 2E-® - 6E-’ 
-6E-*-3E-»-E-*« 

Spencer’s formula may therefore be written 
< = Tb{60Mo + 57(w-i + «i) + 47 (m- 2 + Wg) + 33 (m. 8 + itj) 

+ 18(«.4 + «4) + 6(m_ 5 + Wg) - 2(M.g + Mg) - + iiy) 

— 5(m_8 + Mg) — 3(m_j + Mg) — (m_jq + Mjq)} 
or 

«g' = 0'171 mo + 0"163(Mi + M_j) + 0'134(?/g + w.g) + 0'094 (m 3 + w.g) 
+ 0-051(m 4 + M.g) + 0-017 (m 5 + M-g) - 0-006(itg + m.,) 

-0"014(Mg + M_g) - 0-014(Mg + M.g) - 0*009(7/,+ M.g) 

-0*003(ttig + M.J. 

In the practical application of the formula we form the 
expression 

J( - Mg + Mj + 2 «o + M-i - M-a), 

sum by 7*8 and divide by 7, then sum twice by 5’s, dividing by 
6 each time. 

The following is an example of the working process of 
Spencer’s 21-term formula: f 

* This was employed in the graduation of the rates of mortality exhibited 
by the Manchester Unity Experience, 1893-97. Cf. J.LA, 88 (1904), p. 334; 41 
(1907), p. 361. 

t J. Spen96r, J.l.A, 88 (1904), p. 339. 
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0 ) 

(3) 

(3) 


0) 

(«) 

(0 

(«) 

( 0 ) 

( 10 ) 

Age 

(X). 

Ungraduated 

Qx. 

Divide 
by 7 

—lb. 

Sum 
in S’s. 


It.a 

+W 3 . 

(0-(5) 

Sum 
Col. G 
in rs. 

Divide 
by 5. 

Sum 
Col. 8 

in 5*8. 

Graduated 
7*ssBura in 
5’s cutting 
down aa 
for as 
necessary. 

20 

0-00431 

62 









1 

0-00409 

58 

181 

239 







2 

0-00429 

61 

179 

240 







3 

0-00422 

60 

197 

257 

128 

129 





4 

0-00530 

76 

208 

284 

129 

155 





25 

0-00505 

72 

214 

286 

136 

150 





6 

0-00459 

66 


275 

140 

1.35 

997 

199 



7 

0-00499 

71 

212 

283 

160 

123 

1034 

207 



8 

0-00526 

75 

226 

301 

157 

144 

1043 

209 

1055 


9 

0-00663 

80 

239 

319 


ini 


214 

1101 


30 

0-00587 

84 

249 

333 

167 

166 

1130 

226 

1154 


1 

0-00595 

85 

261 

346 

182 

164 

1223 

245 


•00614 

2 

0-00647 

92 

273 

365 

189 

176 

1302 

260 

1294 

-00648 

3 

0-00669 

96 

295 

391 

195 

196 

1375 

275 



4 

0-00746 

107 

312 

419 

BTiKl 

216 

1439 

288 

1438 


35 

0-00760 

109 

327 

436 

213 

223 

1508 


1501 

Hi 

6 

0-00778 

111 

338 

449 

215 

234 

1566 

313 

1559 


7 

0-00828 

118 

350 

468 

238 

230 

1616 

323 

IftWJ 


8 

0-00846 

121 

358 

479 

246 

233 

1667 

333 



9 

0-00836 

119 

371 

490 

266 

234 

1746 

349 



40 

0-00916 

131 

387 

518 

272 

246 





1 

0-00956 

137 

413 

550 

283 

267 





2 

0-01014 

145 

436 

581 

280 

Bitil 





3 

0.01076 

154 

461 

615 







4 

0-01134 

162 

477 

639 







45 

0-01124 

161 










145. Graduation Formulae obtained by Sitting a Poly¬ 
nomial. —We shall next consider a class of graduation formulae 
whichare based on wholly difiFerent principles from the summation 
formulae. Supposing the ungraduated values Ux to be plotted 
as points against the corresponding values of x, we shall fit 
a parabolic curve of some assigned degree j to the points 
{u^ny '*^-n+n . . Wq, . . ., v„), determining the constants of the 
curve by the Method of Least Squares, and we shall then take 
the ordinate of this curve at a? = 0 as the graduated value of 

*Cf. Sheppard, Proe. F. ItU. Cong, (1912), (ii) 348; Proc. L.M.8,'^ 18 
07; J.I.A. 48 181, 300, 40 148; Sherriff, Proe. B.S.E. (1920) 112; Condon, 
Calif. PuU. 8 (1027) 66; Birge and Shea, ibid., 67. • 
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Let us then find the polymmial of degree 

U{x) =Cq + CjX + + . . . + CjXi, 

n f ]2 ^ 

for which 2 -1 - u{p) | is a minimum ; then i^(0) vnll he taken 

to be. the graduated value of u^. 

The equations of condition to determine Cy, . . Cj are 

feQ + Cyn + c^n^ +. . , + Cj^ =Un 

Jco + Ci(ii-1)+C2(?i-1)2 + . • . + = 


[c^-Cjn + c^n^-. . . + Cj(-w)i 

If we now form the normal equations, it is evident that the 
coefficient of in every alternate equation vanishes, the sums 
of the odd powers of the natural numbers from - m to + m being 
zero. Let j == 2k or 2k + 1. Denote by 2 a summation over the 
values from ^nto n inclusive; let 2 .sp be denoted by 2^* and 
let 2sPw,, which is the ^th moment, be denoted by Mp. Then 
the normal equations which involve are 

^0^0 **’ ^2^2 + . • . + C2J^2k ^ Mq, 

® 0^2 ^ 2^4 + • • • + ^ 2 fc 22*+2 Mg , 

• • * • • 

^0^2k + ^222jb+2 + • • • + — M2jfc 


Solving these equations for we have 

. . . Msu; 

• • • ^+2 

• • • 2)2Jfe+4 


Mo Mg 
^2 ^4 
24 2 o 




.^ 0 ): 


^2k ^ 2 ik +2 • • • ^4ik 


2 , 


24 


• 22|.+2 
• 22fc+4 


1 ^2k 22Jb+2 • • • ^41; I 

* From § 69, we note that the sum of the pih. powers of the natural 
numbers from - n to inclusive (wliere p is an even number] is given by 
Sp=2{l^»+2®+. . .+W3P} 

where 84 = 1 ^, • • • 
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This is the OradvMted Value of u ^—Since the moments Mj, 
Mj, ... are linear functions of the ungraduated data . . ., 
Un, we see that the graduated value u^^ is also a linear function 
of these data. 

The following table is useful in performing the computations. 


[Table 



TabiiB of the Sous of Powers of the Naiobal Numbebs 
2,=(-«»)»>+(-m+l)«'+. . 




10 . 

770 

50666 

3956810 

335462666 

29828683850 

2734857072266 

256075605906890 

24341412264019466 

o» 

570 

30666 

1956810 

135462666 

9828683850 

734857072266 

56075605906890 

4341412264019466 

o6 

408 

17544 

893928 

49369224 

2855115048 

169997999304 

10322020996968 

635371886315784 


280 

9352 

369640 

15814792 

707631400 

32559045832 

1525927974760 

72421932894472 


182 

4550 

134342 

4285190 

142680902 

4876471430 

169481829062 

5956071755270 

j 

no 

1958 

41030 

925958 

21748550 

522906758 

12753500870 

313851940358 


60 

708 

9780 

144708 

2217300 

34625508 

546469620 

8676159108 

CO 

28 

196 

1588 

13636 

120148 

1071076 

9598708 

86224516 


10 

34 

130 

514 

2050 

8194 

32770 

131074 


1 04CMCiC<l 

» 

1 O" ^ ^ ^ 
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146. Table of these Formulae.—By substituting par¬ 
ticular numbers for k and n and evaluating the determinants, 
we obtain the following table of the graduation formulae which 
are obtained by fitting a polynomial to the data. 

Case I, h = 0, i.e, fitting a straight line to the data. 

Case IL 4=1, i,e. fitting a 'parabola of degree 2 or 3. The 
general formula in this case is 

+ 3?i - 1 - 5s^ 

where - 3 • f2n-l)(2n + 1 ) ( 2 ^ 3 )' 

This gives 

W = 1 1(q = Vfy . 

W = 2 Wo*= + - 3(«2 + M-2)} = ^^0“ 

w = 3 Wo'= ji\{7i(o+6K + «-i) + 3(M2 + «- 2 ) -2(»3 + M.3)}. 

M = 4 ttg' = + “-i) + ^9(!«2 + K-a) + 14 (m3 + 

— 21(Mj +W-^)}. 

m = 5 Kq' = -4-jjr{89Mo 34(«i + «_i) + 69(«2 + tt-2) + 44 (m3 + m-,) 

+ 9(?<4 + «-4) - 36(«5 + 

m = 6 = iiir{25Mo + 24^ + 'u.^)-¥ 2l{v^ + «-2) + 16K + 

+ 9(?<4 + «_ 4 ) - ll(!/g + «.e)}. 

m = 7 ’'i)' = viVB{167?fo+ 162(«4 + m_i) + 147(m2 + '**-8) 

+ 122(!/3 + M_3) + 87(^4 + « -4) + 42(H5 + U _g) 

— 13(wj + w_ j) — 78('!/y + 

7i - 8 Wo' = {43 mo + 42 (itj + m_ j) + 39(Wg + w-g) + 34(«3 + W-j) 

+ 27(W4 + W_ 4 ) + 18(W5 + 7 ^. 5 ) + 7(lfg + 7t_j) 

- 6(tij + W.7) - 21(7«g + ii.g)}. 

n = 9 ifo' = ^a‘(rT{ 269 K.o + 264 ( 7 «i + 7 (_i) + 249 (M 2 + 7 t. 2 ) 

+ 224(113 + 11 . 3 ) + 189(M4 + 74 . 4 ) + 144(775 + 7*_j) 
+ 89(77g + 7i_g) + 24(7<2 + 7<-7) - 61(7fg + 77.g) 
-136(M, + 7i_g)}. 

n —10 77o “ ■5TyVv{3297/Q + 324(77^ + + 309(772 + 77 _g) 

+ 284(773 + 77.3) + 249(774 + 77.4) + 204(775 + 77.5) 
+ 149(77, + 77_,) + 84(77, + 77.,) + 9(77g + 77_g) 

- 76(77, + 77_g) - niKo + 7t-io)}. 
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Case III. k = 2, i.e, fitting a parabola of degree 4 or 6 . 

» = 2 <=>Mo. 

» = 3 =-+ M-i) - 30(«j + M-j) 

+ 5(M8 + tt_3)}. 

» =• 4 Uq = T2'Tr{l'79M(, + 135 (mj + li.j) + 30(«2 + ^- 2 ) 

- 55(«3 + M-a) + 15 (m4 + M- 4 )}. 

» = 5 1 * 0 ' = irsT{143M0 + 120(«i + •m_i) + 60(«2 + m-^) 

- 10 (m 3 + tt-a) - 45 (m4 + + 18(m5 + m.j)}. 

71 =» 6 Ug = + 600(% + it.j) + 390(«2 + 

+ 110(7<3 + M.g) - 135(7*4 + 7*_4) - 198(7*6 + 7{_j) 

+ 110(7*4+ 7*_6)}. 

n = 7 7*o' = TB-l-ffT{llOC 37*0 + 10125(7*4 + 7 *_ 4 ) + 7500(7*2 + 7*-j) 

+ 3755(7*3+ 7*_3) - 165(7*4+ 7*_4) 

- 2937(7*5 + 7*_5) - 2860(7*4 + 7*_4) 

+ 2145(7*2+ 7*_2)}. 

71 = 8 7*0* = i“^ff{8837*0 + 825(7*4 + 7*_4) + 660(7*'2 + 7*_2) 

+ 415(7*3 + 7*_3) + 136(7*4 + 7*_4) - 117(7*6 + 

- 260(7*4+ 7*.4) - 196(7*2+ 7*_2) + 195(7*4+7*_g)}. 

n = 9 7*0' = ^7^1139326^ + 1320(7*4 + 7*_4) + 1110(7*2 + 7*_2) 

+ 790(7*3 + 7*_3) + 405(7*4 + 7*_4) + 18(7*6 + 7*.j) 

- 290(7*4+ 7*_4) - 420(7*2 + 7*_2) - 255(7*4+ 7*_g) 

+ 340(7*4+ 7*_,)}. 

7t = 10 7*0' =• aao'oi Tl440 037*0 + 42120(7*4 + 7*._4) + 36660(7*2 + 7*_2) 
+ 28190(7*3+ 7*_3) + 17055(7*4+**.4) 

+ 6378(7*6 + 7*_6) - 3940 ( 7*4 + 7*_4) 

- 11220 ( 7*2 + 7t_2) - 13005(7*4 + 7*_4) 

- 6460(7*0 + T*.,) + 11628(7*40 + 7*_40)}. 

147. Selection of the Appropriate Formula.—^Among 
the many formulae of the last section, we have to determine the 
one which is most appropriate to the particular material that is 
to be graduated; this may he done in the following way: 

Prom the formulae of § 145 we see that if we tried to fit an 
ordinary parabola y“C 0 + <! 4 a; + c^* to data 7*.,j, 7*_„+j, . . ., 7*„, 
then we should have 

®o^o + °° ^o» 

^0^2 ^2^4 “ 

. »?oM,-3M0 

'* vr-s,* • 


and therefore 
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Now Cg is JAY where AY denotes the second diflference of y, 
which is of course constant since y is of the second degree in x. 
So if we try to represent a certain stretch of the data, say from 
to inclusive, by an ordinary parabola, then the most 
probable value of AY for this parabola is 

2(SoM,~vMo) 

20^4-^ 2 " ■ 

This is readily calculated (the 2*8 being given at once by the 
table of § 145), and thus we can make a preliminary test of the 
value of AY for curves roughly fitting the data in different 
" stretches.” This enables us to judge what is the lowest order 
of parabola which will give a satisfactory fit when we use some 
definite number {2% + 1) of data in the graduation formula. 

Ex, 1 .—If a set of observations i/g* • • •» Vw corresponding to equi^ 
distant values of the argument Xj is given, and if we represent these as well 
as possible by a formula of the type y = Aa + B, so that Ay is constant for 
the graduated values^ then the most probable value of this constant Ay is 

_ 6 _ 

n(n* - 

{l.(n-l)Ai /3 + 2.(n-2)Ai/2+3.(n~3)A^3 + . . . + (n- 1).l.Ayn-i}. 

Ex, 2. —If a set of observations y^, y.^ , , y,i, corresponding to equi-‘ 
distant values of the argument x, is given, and if we represent these as well 
as possible by a formula of the type y = Ax^ + Ba; + C, so that Ahj is constant 
for the graduated values, then the most probable value of this constant AY w 

30 
(,?- 

Ex, 3. —If a set of observations y^ , , ., y^ corresponding to equi¬ 
distant values of the argument x, is given, and if we represent these as well 
as possible by a formula of the type y = Aac® + Ex^ + Ca; + D, so that AY w 
constant for the graduated, values, then the most probable value of this 
constant is 

140 

n(n^ .22) (w2 - 32 ) 

^{p(p-¥ l)(p + 2)(ti-p)(n-p- l)(?i-p-2)AY2,}. 

V 

148. Tests performed on Actual Data.* —We shall now consider 
the relative merits of Summation formulae and Least-square formulae 
as tested by their performance when applied to definite numerical data. 

* Sherritf, loc, dt. 
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We have first to decide what is to be accepted as the measure of good 
performance in a graduation. The test we shall use may be described 
thus: 

Consider some known analytic function of Xy such as log x, of which 
tables accurate to, say, 6 places are available. A 4-place table of this 
function may be prepared by omitting the last two digits (which will be 
called the tail) and forcing,* i.e. increasing the last retained digit by 
unity when the omitted tail begins with one of the digits 6, 6, 7, 8, or 9. 
We can regard the values of log x given by the 4-place table as affected 
with “ errors,” namely, the errors which have been produced by omitting 
the tails. Let us now take a sequence of these 4-place values, and 
graduate them by the graduation formula which is to be tested; the 
effect of the graduation should be to smooth out the “errors” and restore, 
to some extent at least, the more accurate values of the 6-place table. 
The success with which this is performed may be taken as a measure of 
the merit of the graduation formula; for it must l^e remembered that 
the purpose of a graduation formula is precisely to reduce the magnitude 
of accidental errors. The advantage of using a known function, such as 
log X, for the test is that we can be certain that the errors (viz. the tails) 
are accidental, ue, non-systematic. There is, however, the disadvantage 
that the errors do not obey the normal law of frequency, since within 
the limits ±0*5 of the last place, the probability of an error € does not 
vary with e 

In the following table this method of testing is applied to the function 
10 ’ 

— - 39,999'95. Spencer’s formula and the Least-square formula 1, 

m= 10 are used. The merits of the graduated values are obtained by 
comparing columns 9, 10, and 11: the result is that the sum of the 
squares of the residual errors is 873 when the Least-square formula is 
u^, and 1327 when Spencer’s formula is used. 


[Table 
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Graduation of thb Reciprocals of Nuebers by Spencer’s Formula 
AND THE Least-square Formula ^=1, ?i=10 
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149. Qraduation by Redaction of Probable Error.— The 

graduation formulae which we have obtained by fitting poly¬ 
nomials have been derived otherwise by W. F. Sheppard,* who 
approaches the problem in the following way: 

As before, let... '*^ 2 > • • • ungraduated 

values of u. Suppose that the determination of each of these 
is subject to the probable error e. If the graduated value of 
-Wo is 

V = W'o + Pl^h + + • • • 

+ . . .+JP-nW-n, 

then the probable error of is 

(p-n®+^?-n+l^ + - • ^+Pn-1?+Pn)^€. 

Sheppard lays down the condition that this quantity is to be 
a minimum, subject to a further condition which secures that 
the graduated values shall not ditfer systematically from the 
ungraduated. This latter condition he takes in the form that 
Uq is to differ from u^ only by differences of u^ of order (j +1) 
and upwards \ this amounts to supposing that the (/’+l)th 
differences of the u'^ are negligible. 

Now, by a discussion resembling Laplace’s and Gauss’s 
Theoria Gombinationis proof of the Method of Least Squares 
(§ 115), we see that Sheppard’s conditions are really equivalent 
to the two conditions which were laid down in § 145; and 
hence the graduation formulae obtained ly this method are 
identical with those which have been obtained (§§ 145-14*7) by 
fitting polynomials to the data by Least Squares. 

150. The Method of Interlaced Parabolas.— A method 
of graduation proposed in 1922,f which yields satisfactory 
results when applied to actuarial data, may be explained as 
follows. 

* Proe. of ihe Fifth IrU. Congress of Mathematicians (Cambridge, 1912), ii. 
p. 348, and other papers quoted in the footnote, p. 291. A. C. Aitken, Proc. 
Soy. Soc. Edin. 58 (1932), p. 54, has solved the problem of polynomial 
graduation by use of orthogonal polynomials, obtaining a process which is 
decidedly preferable to those of Sheppard and Miss Sherriff. 

- 53 (1922), p. 92. 
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Let a polynomial of the third degree 

w = a + 6a; + + doi? (1) 

be determined by the conditions that (1) it is to take as nearly 
as possible the values • • •> '*^ 2 > • • •> when 

X has the values - m, - m + 1,. . - 2, 0, 2,. .respectively, 
and (2) it is to take precisely the values u' and when X 
has the values -1 and 1 respectively. Here as usual 
. . . denote ungraduated values and . . . denote gradu¬ 

ated values, so that we are really finding a parabola of the third 
degree which will fit as well as possible the ungraduated data 
• • •> rigorously the two gradu¬ 

ated values and V, which for the moment are supposed 
already known. Then the ordinate of this parabola at x=^0 icill 
be taken to be the graduated value u^. The graduated values 
will therefore be given by a series of interlaced parabolas, each 
of which passes through three consecutive graduated values, so 
that each successive i)air of these parabolas has two points on 
the graduated curve in common. The equations of condition 
are evidently 

+ 6r + cr* + (2) 

for r = - ui, - //I +1, . .- 2, 0, 2,. . m ; and we may without 
error include 7’ = - 1 and r = 1 in this sequence, since the effect 
of the two equations of condition thus introduced will be 
nullified by the two equations which have to be satisfied 
rigorously, namely, 

%(!.^ = a- b + c - d 
u(r=a-\-b-\-c + d 

r — m. 

Denoting as usual 2 by 2^, we have therefore to choose 

rss —7/1 

a, 6, c, d so as to make 

m 

2 (a. + 6r + cr® + (f/’* - w,)® 

r= -7)1 

or 

a% + + c »24 + + 2ac2, + 2hd\ - - 2J25'«, 

- 2c27^zfy - 2d2,7^^ 
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a minimum subject to the rigorous equations (3). The normal 
equations are therefore (§ 108) 

4* cSg — + X + fi =0 

tSg + - X + /A = 0 .. V 

+ X + /A = 0 
+ 62^ - — X + /A = 0, 

The unknowns a, h, c, d, X, /a are to be determined from 
equations (3) and (4). Evidently we need only consider the 
equations which involve a, c, and (X + /a), which are 


+ '/</= 2a + 2c, 
oEq + c22 — 2?/y + (X + /a) = 0, 
c 24 + a22 - 2r%y + (X + /a) = 0, 


and we have also u„' = a. 

Eliminating a, c, and (A + /*) from these four equations, we 
have 


_ 2(2,-2^> 2o) 22(r^-lK 

y „ oy j, y 

Thus if we write cos0 for —* .is —and for 

^4-■2-2 


— j.jjg graduated values of satisfy the linear 


difference equation 


« *+i - 2 cos 5 
the solution of which is 
ii'=A cos a;0 + B sin a;0 

w 

KoSina:fl + KiSin(a!-1)0 + . . . + Kj5_i8in0 
+ , 

where A and B are the constants of integration; they may be 
used, in the case of mortality data, to make the deviations of 
the actual from the expected deaths, and the accumulated 
deviations, zero; or in all cases they may be used to reproduce 
the moments of order zero and one, of the ungraduated u’s. 
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151. A Method of Graduation based on Probability.*— 

The methods described above fulfil their purpose of smoothing 
out irregularities from observational data in a way which is on 
the whole efficient. From the purely theoretical standpoint they 
are not altogether satisfactory, since each of them contains 
arbitrary or empirical elements whose introduction does not 
appear to be logically necessary; in the methods of §§ 142, 
160 it is not obvious (apart from mere convenience) why we 
should fit parabolic curves to the observations rather than fit 
curves such as (say) 

y • 

In order to find a sounder basis for the theory, we must 
remember that the problem of graduation belongs essentially 
to the mathematical theory of probability; f we have the given, 
observations, and they would constitute the "most probable** 
values of u for the corresponding values of the argument, were 
it not that we have a prion grounds for believing that the 
true values of u form a smooth sequence, the irregularities 
being due to accidental causes which it is desirable to eliminate. 
The problem is to combine all the materials of judgment—the 
observed values and the a pnon considerations—in order to 
obtain the " most probable ** values of u. 

Let us then suppose that we are concerned with a number 

which depends on an argument a;, and suppose that we have 
n data which are affected with uncertainties or irregularities 
due, e,g.^ to accidental errors of observation; so that when is 
plotted as a function of x, the n points so obtained do not lie 
on a smooth curve, although there is a strong antecedent prob¬ 
ability that if the observations had been more accurate the 
curve would have been smooth. We may make the somewhat 

* Whittaker, iVoc. Edin, MaJth, Soc, 41 , p. 63 (read Nov. 14, 1919: printed, 
with additions, in the volume for 1922-23). The method has been further 
improved by Whittaker, Prot, Jl,S. Edin, (1924). 

t Tiie first recognition of this fundamental principle seems to have been 
made by Mr. Q. King in the course of the discussion on Dr. T. B. Sprague's 
paper of 1886, J,LA. 26 , p. 77: '* What is the real object of graduation ? Many 
would reply, to get a smooth curve; but that is not quite correct. The reply 
should be, to get the most probable deaths.*' 
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vague word “ smooth ” more precise bj interpreting it to mean, 
that the third differences are to be very small. 

Now consider the following hypothesis: that the true value, 
which should have been obtained by the observation lies 
between and < + <r, where cr is a small constant number 
[e.g. one unit in the last decimal place used in the measures); 
that the true value which should have been obtained by the 
observation for lies between and + cr, and so on; and 
finally the true value which should have been obtained by the 
observation for lies between and + cr. This hypothesis 
we shall call ** hypothesia H** Before the observations have 
been made we have nothing to guide us as to the probability 
of this hypothesis H except the degree of smoothness of the 
sequence which may be measured by the smallness 

of the sum of the squares of the third differences. 

S = « - 3< + 3< - + 37 / 3 ' - 7^)2 +. . . 

+ {< - 

S may be called the nteasure of roughness of the sequence. 

The theory may be extended to the case when the observations are 
not taken at equidistant values of the argument, by taking instead of 
S the sum of the squares of the third divided differences of the graduated 
values. 


We may therefore, by analogy with the normal law of 
frequency, suppose that the a •prioi'i probability of hypothesis 
H is 

ce-^* 8 o-», (A) 


where c and A denote constants. 

Next let us consider the a pinori probability that the 
measures obtained by the observations will be % . . ., on 
the assumption that hypothesis H is true. 

Since the true value of the first observed quantity is, on 
this hypothesis, the probability that a value between and 
«i + <r will actually be observed is (postulating the normal law 


of error) 


a 


where A, is a constant which measures the precision with which 
this observation can.be made. Similarly the probability that 
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a value between and 4- <r will actually be obtained for the 
second observed measure is 


75^ 


where Ag is the measure of precision of this observation. Thus 
on the assumption that hypothesis H is true, the a priori 
probability that the observed measure of the first observed 
quantity will lie between and + tr, the observed measure 
of the second observed quantity between and + and 
so on, is 






(B) 


where F denotes the sum 


F=VK-“iT+VK-<)*+• • •+VK-«»')*• 

The sums S and F enable us to express numerically the 
smoothness of the graduated values, and the fidelity of the 
graduated to the ungraduated values respectively. 

We must now make use of the fundamental theorem in the 
theory of Inductive l^robability, which is as follows. Suppose 
that a certain observed phenomenon may be accounted for by 
any one of a certain number of hypotheses, of which one, and 
not more than one, must be true: suppose, moreover, that the 
probability of the sth hypothesis, as based on information in 
our possession before the phenomenon is observed, is p^, while 
the probability of the observed phenomenon, on the assumption 
of the truth of the sth hypothesis, is P,. Then when the 
observation of the phenomenon is taken into consideration, the 
probability of the sth hypothesis is 

^Psv: 

where the symbol 2) denotes the summation over all the hypo¬ 
theses. It follows from this that whereas before the phenomenon 
was observed the most probable hypothesis was that for which 
Pt was greatest, the most probable hypothesis after the pheno¬ 
menon has been observed is that for which the product V^ps is 
greatest. Applying this theorem to the case under ponsidera- 
(d8ii) 
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tion and combining expressions (A) and (B), we see that the 
most probable hypothesis is that for which 

ehyh^ . ■ . Ata-p «n 

is a maxMmm ; that is to say, tlu most probable set 
of values of the quantities is that which makes 

A*S + F 

a minimum. 

152. The Analytical Formulation. —^Writing down the 
ordinary conditions for a minimum, we obtain the equations 

= h^tf - 

= h^u^' + - A®A®Mj', 

V^s = V'<a' - 3 A*A»m/ + 3A*A%j' - A^A^, 

V“4=V«4' + - 3A*A%j' + 3A*AX' - A*A*<, 


= KW + A*A%'„.3. 

We shall now make the simplifying assumption that the measure 
of precision is the same for all the data, so = . . . = A„. 

If this is not the case, we graduate some function of u, such as log u, 
instead of u, choosing this function so that its measure of precision has 
nearly the same value for all values of the argument. 

If we write hf = eA®, the equations may now be written 

eu^ = - A®Mj' ' 

etfj = eMj' + 3 A*Mi' - Ahi^ 

cMj = eu^' - 3A%/ + 3A»Mg' - AVg . 

€U^ = eu^ + A*Mj' - + 3A®i<3' - ti?u^ ' ' ' 

€M„ = eM„' + A%'„_3 ; 

Now all these equations, except the three first and the three 
last, are of the form 

= €Uf - AV* _s. 

Moreover, if we introduce a quantity uf such that = 0, the 
third equation becomes 

e«3 = €«3'-A*Mg', 

which is cf the same- form; and similarly the first two and last 
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three equations can be brought to the same form by introducing 
new quantities «'_j, «'_j, it'n+z, tt»+ 3 , such that 

AV.i = 0, A*m'_ 2 = 0, AV„.2 = 0, A*?<'„.i = 0, A®?/,.' = 0. 
Thus the graduated values ti j satisfg the linear difference equa¬ 
tion 

eUx “ A®/f j;_3 = cy/j., (2) 

ieing in fact the particidar solution of this equation which satisfies 
the sic termincd conditions 

A®yyo' = 0, AV_j = 0, AV_jj = 0, AV„_2 = 0, AV„_i=0, 

AV = 0, (3) 

whence we have at once 

A%/_2 = 0, AV.i = 0, AV.2 = 0, AV„.2 = 0, AV„.i = 0, 

AV„.2=0. (4) 

153. The Theorems of Conservation.—From (2) we have 
by summation 

€(«/ + #2' + . . . + ?y,;)-€(yyi + ?y2 + . . . + ?«„) 

= A«yt'.2 + AV_i + . . . + A«»'„.3 

= A®«'„.2-AV.2 
= 0, by (4). 

Therefore 

Ml' + 'Hj +. . . + u,'==v^ + v^ + . . . + 7/„. ;5) 

Moreover, by (2), 

t{uf + 2u^ + . . . + nyf,/)-€(yq + 2 yy 2 + . . . + m(0 

= A«?t'_2 + 2A*yf'_^ + . . . + «A«jy',._3 
= «AV„.2 - A«y?'„-2 + A®?t'.2 
= 0, by (4). 

Therefore 

Ui+2uf + . . . + nu,’=Vi + 2Hg + . . . + wy„. (6) 

Next, by (2), 

€« + 2*M2' +. . . + nV) - eO'i + 2*^2 +. . . + »X,) 

= A«M'.2 + 2®A«7t'_i + . . . + »®AV„.3 

= »*AV„_2 - (2w - 1)AV„.2 + 2A»y/„ -2 - AV_j 
= 0, by (3) and (4). 

Therefore 

< + 2 * 7 / 2 ' + - • . + wV = «i + 2*«* + - • • + ’*V 


( 7 ) 
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Equations (5), (6), (7) show that the moments of orders 
0 ,1, 2 are the same for the graduated data as for the original 
data. This may be called the Theorem of the Conservation 
of Moments. We may express it by saying that the graph 
which represents the ungraduated data and the graph which 
represents the graduated data have the same area, the same 
X co-ordinate of the centre of gravity, and the same moment 
of inertia about any line parallel to the axis of u. 

154. The Solution of the Difference Equation.^ —We 
have now to solve the central difference equation of gradua¬ 
tion, 

AVa;-3 = €Ua., 

subject to the six terminal conditions imposed in § 152. Let 
us assume for the present that € is given. Then we may 
obtain a general solution by means of symbolic operators as 
follows. 

Since A ^ E -1, the equation, in terms of E, becomes 

r(E-l)«-cEn 

I E^ 

[ E® “I 
(E — 1)® — cE^ 

Considerations of symmetry lead us to expect that each u' 
will be given as a linear function of u's, with coefficients 
symmetrically placed about that of the central term. This 
suggests that we expand the operator on the right of equation 
(8) in powers of both E and E“^, in fact as a Laurent series.^ 
Such an expansion is readily seen to be possible, for the equa¬ 
tion {z-Xf-€i!? = 0,hemg a reciprocal equation, has its six 
roots reciprocal in pairs, so that three are less in modulus than 
unity, and the remaining three greater; hence 
may be expanded as a Laurent series, convergent for 0 = 1, 
and the coefficients, with which we are chiefly concerned, may 
be found by the usual theory. That these coefficients will be 

The method of solution described in §§ 154-155 is due to Dr. A. O. Aitken 
and was first given in a thesis for the doctorate of the University of Edinburgh, 
submitted in May, 1925. 

tCf. Whittaker and Watson, Modem Andlyeie, g 5.6. 



GRADUATION, OR THE SMOOTHING OF DATA 309 


symmetrically disposed follows from the fact that the operator 
is left unaltered by the substitution of E“^ for E. 

Appeal to the theory of Functions of a Complex Variable may be 
avoided, if preferred, by having recourse to the theory of Partial Frac¬ 
tions. If we resolve the operator in question into six partial operators 
by this theory, then, for the reasons already stated, three of the partial 
operators may be expanded in powers of E, tliree in powers of E”*, 
whence by addition the complete expansion is obtained. 

Either of these ways leads to the same graduating formula, 
namely, 

+ • • •> (9) 

where ^ ( 10 ) 

(a-pXa-yXa-a ^)(a-^ ^Xa-y V 


in which a, )Q, y are the three roots less in modulus than 
unity* of the equation and 2 denotes inter¬ 

change of a, jS, y followed by summation. One of these roots 
is evidently real; hence we may write them as 
when ( 10 ) becomes 

- % —;- 

- 29\r2 cosO + - 2^-} cosO + ^j | j 

n+l 

where A = 

1 — 

B = sin(»- 2 ) 6 - +^^si^0^ - 1)6 

+^ sin(» +1 )0 - ^ sin(»+ 2 ) 0 j, 

C = (l-r 22 ) 8 infl^r 2 *- 2 cos 20 +-l) 


By Tnft«.na of ( 11 ) the coefficients have been calculated for 
a number of representative values of €, O-Ol, 0*02, 0«05, 0*1, 
0-25, and 1. A table of these to four decimal places is given 
on p. 314. 

A defect inherent in most methods of graduation based on 
linear compounding of tt’s on either side of a particular is 
that it is iTYipossible to gvcLductts the whole of d given set of 

* It is assumed throughout that e is positive, not zero. 
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data\ e.g. Spencer’s formula is inapplicable when we are 
within ten places of either terminal. (See the example of 
§ 144, p. 291.) This defect will also seriously impair the 
present method, unless by some means, without departing 
from the conditions of the problem, we can attach to each 
end of a given set the additional, or, as we may call them, 
auxiliary data, Un+2, •.aiid u_i,..., the existence of 

which is implied in formula (9). (The number of data to be 
attached in practice will depend on the degree of accuracy 
required and the rapidity with which the successive graduating 
coefficients diminish.) As in § 152 we introduced six new 
quantities ^ 6 n+l, tfcM+2, tt'n+a, so let us now intro¬ 
duce a further indefinite number Un+ 4 , and u'_s, 

u'-. 4 ,..., under the same condition, namely that third 
differences involving graduated data beyond the original 
tei^ini are to be all zero. Differencing the added third 
differences three times more, we obtain 

A^u'n ^2 = 0, AV«-.i = 0 ,..., and AV.a = 0 , AV -4 = 0,..., 

from which, referring again to the difference equation, we 
must have 

v/ Ji+I = Un+l, nfn+2 “ 'W/n +2 • • •> and v/q = Uq, v/ -1 = U-1. • •• ( 12 ) 

Thus the condition imposed on the auxiliary data is that 
graduated and ungraduated values are to coincide. It also 
follows that 

A®t4n+i = 0, A®u„+ 2 =0,..and A%- 8 =0, A®u .4 = 0,. • ^ (13) 

The difference equation ( 2 ), with the conditions (3) and 
(12), or (3) and (13), suffices to determine the external data. 
If the original number of data to be graduated is not too 
small (in general, if it exceeds twenty-five), the external data 
may be found as accurately as may be required by the follow¬ 
ing method. 

We have, from ( 12 ) and the difference equation ( 8 ), 
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This equation may be written 

(E-1)3 (E-1)3 

(E - a)(E - ^)(E - y)-(E - a-i)(E - ^-i)(E - yi)' 


(14) 


We shall show that 'M^n+ 2 > • • • are given by the equa¬ 
tion formed by retaining only the first operating factor on the 
left side of (14), viz. the equation 


(E-a)(E-^)(E-y) 




(15) 


In the first place we see that (15) gives u* uniquely in 
terms of for the operator can be expanded in 

descending powers of E, in the form 

1 -jiE“^ “ jsE"* -.. 


so that we obtain 

'^x +i2'*^a;-2 + • • •> 


(16) 


for 0 ? = ti + 1, 71 + 2, .... From this equation Un+i, t 6 n+ 2 i ••• 
are determined in succession, provided that the terms con¬ 
taining the undetermined external data 7^, U-i, ... are 
negligible. 

Next, the values of xix thus determined will satisfy (14)^ 
for the operator 

(E- 1 )^ 

(E-a-i)(E-^-i)(E-y'i) 


is expansible in ascending powers of E, and therefore (14) can 
be deduced from (15), which holds for a? = 7 i +1, + 2,.... 

Lastly, the conditions (13) will be satisfied, for from (15) 
we have 


(E-a)(E-/3)(E-y) 


.(E-a)(E-/8)(E-yK = 0. 


since (15) holds for an ascending series of values of x. 
Hence (E-= 0 , for x = n + l, n + 2,.... 
The value of jn iii (Id) easily found to be 

- 1 A» Bin(n-2)9-n Mn(n-!)»] •> 

ri»-2r,r,coB»+r,*^*l 8in« / 
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By means of (17) the coefficients jn have been calculated for 
the same values of € as before, 0-01, 0-02, 0*05, 0*1, 0*25, 
and 1. It is found that they decrease successively with 
sufficient rapidity for all practical purposes. 

Actually only three auxiliary data, Un+ 2 , ^n+si iieed 
be calculated, for the rest can be found, again in virtue of 
(13), by forming a difference table with zero third differences. 
Exactly the same process may be applied to the other end of 
the table by reversing the order of the data. 

We have finally to ascertain what values of c are likely 
to be of standard practical application, and also to find a 
means, given any particular set of data to be graduated, of 
assigning in advance an appropriate value from among these. 
Here we may refer to the notion of a “ smoothing coefficient ”, 
introduced by G. F. Hardy,* and based on the following 
considerations. The 'uCs have been supposed subject to the 
same probable error, p, say. Then the probable error of their 
third differences, e.g. of - 3^2 + 3tfci -is pV(l + 3^ + 3^ +1), 
or p\/20. But if we have a graduating formula 

then = 2( A%n)Un, 

so that the probable error of AVo is ^^/{^(A^Ajn)*}. The ratio 
N/{2(A%n)^}:^/20 may then be regarded as measuring the 
degree of reduction of probable error in third differences; it 
has been called the “smoothing coefficient” for the formula 
in question. 

*JJA. 82 (1896), p. 376. The name “smoothing coefficient” is due to 
J. Spencer. Of. G. J. Lidstone, JJ.A, 42 (1908), p. 114. 
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Table of Coefficients Jn 


e = 001 . 

e =0 02 . 

«= 006 . 

6 = 01 . 

€= 0 - 25 . 

e = l . 

0-9155 
0-4491 
0-1321 
- 0-0563 
- 0-1442 
- 0-1619 
- 0-1374 
- 0-0937 
-0 0474 
- 0-0084 
0-0185 
0-0330 
0-0369 
0-0334 
0-0257 
0-0167 
0-0083 
0-0017 
- 0-0027 
- 0-0049 
- 0-0055 
- 0-0049 
- 0-0037 
- 0-0023 
- 0-0010 
- O-OOOl 
0-0005 
0-0008 
0-0009 
0-0007 
0-0005 
0-0003 
0-0001 
0-0000 
- 0-0001 
- 0-0001 
- 0-0001 
- 0-0001 
- 0-0001 

1-0237 

0-4337 

0-0626 

- 0-1293 

- 0-1917 

- 0-1745 

- 0-1199 

- 0-0579 

- 0-0065 

0-0268 

0-0418 

0-0425 

0-0345 

0-0229 

0-0114 

0-0024 

- 0-0034 

- 0-0060 

- 0-0063 

- 0-0051 

- 0-0034 

- 0-0017 

- 0-0003 

1 0-0006 
0-0010 
0-0010 
0-0008 
0-0005 
0-0003 
0-0000 
- 0-0001 
- 0-0002 
- 0-0002 
- 0-0001 
- 0-0001 

1-1847 

0-3812 

- 0-0600 

- 0-2297 

- 0-2309 

- 0-1542 

- 0-0633 

0-0069 

0-0452 

0-0548 

0-0460 

0.0293 

0-0127 

0-0005 

- 0-0060 

- 0-0079 

- 0-0067 

- 0-0043 

- 0-0018 

0-0000 

0-0010 

0-0012 

0-0010 

0-0007 

0-0003 

0-0000 

- 0-0001 

- 0-0002 

- 0-0002 

- 0-0001 

1-3209 

0-3082 

- 0-1750 

- 0*2965 

- 0-2294 

- 0-1060 

- 0*0027 

0*0531 

0*0654 

0*0512 

0*0281 

0*0078 

- 0-0045 

- 0-0091 

- 0-0083 

- 0-0052 

- 0-0020 

0-0003 

0-0013 

0-0014 

0-0010 

0-0005 

0-0001 

- 0-0002 

- 0-0002 

- 0-0002 

- 0-0001 

1-5204 

0-1523 

- 0-3468 

- 0-3471 

- 0-1700 

- 0-0075 

0-0731 

0-0810 

0-0523 

0-0189 

- 0-0031 

- 0-0112 

- 0-0100 

- 0-0053 

- 0-0011 

0-0012 

0-0017 

0-0012 

0-0005 

0-0000 

- 0-0002 

- 0-0002 

- 0-0001 

1-8615 

- 0-2545 

- 0-5842 

- 0-2660 

0-0302 

0-1257 

0-0899 

0-0276 

- 0-0087 

- 0-0156 

- 0-0088 

- 0-0013 

0-0020 

0-0019 

0-0008 

0-0000 

- 0-0003 

- 0-0002 

- 0-0001 


Formula for extrapolating auxiliary daUi: 

= j \ Un + r-l +j 2 *^H+r-a + J ^’ 3 Wn + r -3 + • • • (r > 0) 




if 


(d 611) 
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Table of Coefficients kn 


e=001. 

e^O-02. 

e»005. 




0-1570 

0-1769 

0-2076 

0-1482 

0-1644 

0-1873 

0-1254 

0-1329 

0-1397 

0>0948 

0-0928 

0-0842 

0-0628 

0-0536 

0-0359 

0-0341 

0-0216 

0-0027 

0-0117 

- 0-0004 

- 0-0145 

0-0035 

- 0-0125 

- 0-0191 

0-0119 

- 0-0165 

- 0-0161 

0-0148 

- 0-0151 

- 0-0099 

0-0138 

- 0-0110 

- 0.0038 

0-0108 

- 0-0062 

0-0005 

0-0070 

- 0-0021 

0*0027 

0-0034 

0-0008 

0-0032 

0-0005 

0-0024 

0-0026 

0-0014 

0-0028 

0-0016 

0-0023 

0-0025 

0-0006 

0-0025 

0-0018 

- 0-0001 

0-0022 

0-0010 

- 0-0004 

0-0017 

0-0003 

- 0-0005 

0-0010 

- 0-0001 

- 0-0004 

0-0005 

- 0-0004 

- 0*0002 

0-0000 

- 0-0004 

- 0-0001 

• 0-0002 

- 0-0004 

0-0000 

- 0-0004 

- 0-0003 

0-0001 

- 0-0004 

- 0-0002 

0-0001 

• 0-0003 

0-0000 

0*0001 

0-0002 

0-0000 

• •• 

- 0-0001 

0-0001 


• 0-0001 

0-0001 


0-0000 

0-0001 


0-0000 

... 


0-0001 



0-0001 



0-0001 





0-2347 

0-2056 

0-1412 

0-0721 

0-0191 

- 0-0110 

- 0-0211 

- 0-0186 

- 0-0110 

- 0-0035 

0-0014 

0-0034 

0-0033 

0-0022 

0-0009 

0-0000 

- 0-0004 

- 0-0005 

- 0-0004 

- 0-0002 

0-0000 

0-0000 

O-OOOl 

0-0001 


0-2771 

0-2297 

0-1356 

0-0486 

- 0-0046 

- 0-0236 

- 0-0211 

- 0-0108 

- 0-0017 

0-0031 

0-0039 

0-0027 

0-0010 

- 0-0001 

- 0-0005 

- 0-0005 

- 0-0003 

- 0-0001 

0-0001 

0-0001 

0-0000 


0-3601 
0-2604 
0-1045 
0-0023 
- 0-0293 
- 0-0213 
- 0-0056 
0-0032 
0-0044 
0-0023 
0-0003 
- 0-0006 
- 0-0005 
- 0-0002 
0-0000 
0-0001 
I 0-0000 


Formula of graduation: 

u'x = hux + kTiux+i + Ux-i) + k^Ux+a + +.... 

Now each c corresponds, as we have seen, to a definite 
linear combination, and therefore to a definite smoothing 
coefficient. Approximate values are given in the table below. 



Smoothing 

ooefficient. 
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Thus a preliminary inspection of the third differences of 
a set of ungraduated data enables us to estimate the amount 
of smoothing likely to be required, and to select accordingly 
an appropriate value of e. 

165. The Numerical Process of Graduation. — The 

routine to be followed in applying the preceding theory to 
actual data falls therefore into three parts: 

(1) To form the tliird differences of the given data, and 
by inspection to decide what degree of smoothing is necessary. 
To choose from the selected values of e that which most nearly 
produces this degree of smoothing. 

( 2 ) To evaluate in succession four auxiliary data, ttu+i, 
Un+2, t 6 n+ 3 , Urt +4 by the formula 

%»+l =ji^n +i3^n-2 + • • • 

^n +2 +j2^n +j3^n-l + ..&C. 

As a check, the two second differences in the difference 
table of these four added data should be equal. Repeat these 
second differences outwards and build up the table to obtain 
as many auxiliary data as arc required. Reverse the order 
of the data and carry out exactly the same process at the 
other end. 

(3) To graduate the extended set by means of the coeffi¬ 
cients Icn- 

Ex ,— To graduate, by this method, the set of data of § 144^ p. 291. 

We shall suppose that c has been taken to be 0*2.5. 

Then 1 / 46 = 1-5204(1124) +0*1523(1134) -0*3468(1076) -... = 1155*6, 

W47=1*5204(1155*.5) + 0*1523(1124) -0*3468(1134) -... = 1156*0, 

tt« = 1-5204(1156) +0*1523(1155*5)-0*3468(1124) -... = 1142*3, 

W 49 = 1.5204(1142-3)+ 0*1623(1156) - 0*3468(11.55*5) - ... = 1114*4. 

Forming a difference table we find, as we should, that the two second 
differences are equal, each being -14*2. By repeating them outwards 
and building up again we easily find the values* (taken to the nearest 


integer): 




W 54 

^*55 

Usf 


1072 

1016 

946 

861 

762 

649 

522 380 

224 


54 

-130 

^61 

-328 

- 541 

U63 

-767 

W64 

-1008 



* No attempt should be made to intet-pj-ci the auxiliary data in terms o£ the 
rest of the table. They are introduced purely to facilitate the solujiion. 
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Turning now to the other end of the table we obtain in the same 
manner 

Ui9=:424*3t Wjg—439*0j tii7=462»8| tii8=495*7i 
and by a difference table (second differences all 9*1) 

ttis lli4 Wl3 Wi2 ttii Mio t/9 Wg ^7 

538 589 649 718 797 884 981 1087 1201 

tig Us 1/4 Us Us Ui 

1325 1458 1600 1752 1912 2081 

The final process of graduating the extended table, and indeed all 
processes which, like the method of Least Squares and the present method, 
depend on forming a symmetrical linear compound, but are not reducible 
to successive “summation”, are most conveniently carried out when 
arranged as on table facing p. 316. On a sheet of computing paper the 
data to be graduated are entered in a column at equal distances below 
each other, and to the left other columns of strictly equal width are allotted 
to each successive graduating coefficient, which is written at the head of 
its column. The products km,Un. hi each column are computed either by 
arithmometer or, in three place work, by Crelle’s Tables. The graduated 
value u' of any u is then seen to be the sum of the entries in the diagonal 
lines which converge in the Jcq column opposite to that w. 

Part of the whole computing sheet is shown facing p. 316. In summing 
the diagonal entries, distraction of the eye may be avoided by the use of 
a V-shaped stencil, which leaves exposed at any time only the particular 
converging diagonals required. 

Checks during the computation may be provided by summing the 
entries in any completed row, the entry in the column under being 
halved. In the row opposite the sum should be A final check 
is provided by the “Theorems of Conservation” of § 153. Thus in the 
present example we find the three moments to be 

18285, 290109, 5581363 for the ungraduated data, 

18284, 290095, 5581071 for the graduated data. 

A comparison of the graduated and ungraduated data and their third 
differences shows that a satisfactory degree of smoothing has been secured 
without great departure from the original values. 

166. Other Methods. —In addition to the methods of graduation 
which have been described in this chapter, mention should be made of 
a method proposed by E. C. Rhodes, and described in his Tract* on 
Smoothing, to which the reader is referred. A memoir by C. Lanczos, 
“Trigonometric interpolation of empirical and analytical functions”, 
Joum. Math. Phya. 17 (1938) 123, may also be consult^ with advantage. 

♦ No. VI of the Tracts for Computers, edited by K. Pearson; Camb. Univ. 
Press (1921). 
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CHAPTER XII 


COIIRELATION 

157. Definition of Correlation —Consider a definite group 
containing a large number of individuals; let us measure some 
attribute A of the individuals, and let us also measure some 
other attribute B. For instance, the individuals might be all 
the stars of the third magnitude, and A might represent the 
parallax of the star, while B might represent its proper motion; 
or the group might consist of all adult Scotsmen, and A might 
represent the height of a man in inches, while B might repre¬ 
sent his wealth in pounds sterling. Consider now the indi¬ 
viduals in the group for whom A lies between x and x + dx 
while B lies between y and y^-dy \ let the number of such 
individuals be y)drAyf wliere X denotes the total number 
of individuals in the group, or, to express the same thing in 
other words, let <f}{x, y)dxdy denote the prohahility that for an 
individual taken at random the first attribute A lies between 
X and X + dx, while the second attribute lies between y and 
y + dy. 

Now Fermat’s Principle of Conjunctive Probability may be 
stated thus: The probability that two events will both happen is 
hk, where h is the probabil ity that the first event will happen, 
and k is the probability that the second event will happen when 
the first event is known to have happened. Applying tliis to the 
present case, let h =f{x)dx be the probability that for an indi¬ 
vidual taken at random from the group the first attribute A 
lies between x and X’^dxi and let k=ff(x, y)dy be the prob¬ 
ability that for an individual taken at random from those 
members of the group whose attribute A lies between x and 

317 ^ 
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x + dx, the attribute B lies between y and y + dy. Then by the 
Principle of Conjunctive Probability we have 

<#•(«. y) =/{%(«. y)- 

Now here two possibilities present themselves. 

In the first possibility, g{x, y) is a function of y only, not 
involving x. When this is the case, if we divide the original 
group into sub-groups according to the magnitude of the 
attribute A, then the probability that B will lie between y and 
y + dyi^ the same for each of the suh-groups. The two attributes 
A and B are then said to be not correlated, and evidently if>{x, y) 
is eoi^essiUe as the prodvxt of a function of x only, multiplied 
ly a function of y only. This would be the case, approximately 
at least, with the height and wealth of the Scotsmen; for let 
the probability that a man is of a certain height x\jO x + dxhQ 
f(x)dx\ then the probability that his wealth lies between y and 
y + dy pounds is nearly the same for tall men as for short men, 
so may be expressed in the form g{y)dy, where g{y) does not 
involve x\ and the compound probability that his height is 
between x and x + dx while his wealth is between y and y + dy 
is then simply 

f{x)g{y)d^y. 

But in a large class of cases the function y) is not capable 
of being expressed as a function of x multiplied by a function 
of y. In such cases the probability that the first attribute has 
a measure between x and a; + rfoj is not the same for individuals 
with large i/b as for individuals with small y% and the prob¬ 
ability that the second attribute has a measure between y and 
y’¥dy is not the same for individuals with large a?'s as for 
individuals with small x\ In such cases the two attributes 
are said to be con^elated. Thus the parallaxes and the proper 
motions of the stars are correlated; for a star which has a 
large parallax, and is therefore comparatively near to us, is 
more likely to have a large proper motion than a small one. 

Many elementary problems in Probability cannot be solved correctly 
without taking account of correlation. For example, the following: 
^The prolmbility that A can solve a mathematical problem taken at 
random from a certain book is and the probability that B can solve 
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one is also What is the probability that a problem taken at random 
will be solved by one or other or lx>th of them ? 

Here the group consists of all the problems in the book, and the two 
attributes of an individual problem are its solubility by A and its 
solubility by B. These two attributes are correlated, since the problems 
that A can solve will be, to a great extent, the same as the problems that 
B can solve. It would therefore be wrong to assert that the probability 
of both failing to solve a problem taken at random is ^ x J = j, and tliat 
conseciuently the probability that one or other or both would succeed is 
1-Jor 

158. An Example of a Frequency Distribution involving 
Correlation. —As an illustration of correlation, let us consider 
two riflemen side by side firing at targets when a strong wind 
is blowing. The wind will be supposed to affect the shooting 
of both men in much the same way, so that we may expect a 
certain amount of correlation between their records. In this 
case the “ group ” consists of all the records of the two men's 
shots, an individual ” of the group is constituted of a single 
shot of the first man together with the shot fired at the same 
instant by the second man, and the attributes ” of this in¬ 
dividual are the deviations of the two shots. 

Let X denote that part of the deviation of the first man's 
bullet from the mark which is due to causes affecting him 
alone and not affecting the other man, i,e, all causes except the 
wind. Similarly let Y denote that part of the second man’s 
deviation which is due to causes affecting him only; and let 
X + aZ denote the total deviation of the first man's bullet, and 
Y + 5Z denote the total deviation of the second man's bullet, 
where Z is due to the wind. For simplicity, we suppose all the 
deviations to be in a horizontal direction from the mark. We 
assume that X, Y, Z are independent of each other, and that 
each occurs according to the normal law of frequency, so 
the probability that 

X lies between x and x + dx is 
Y „ „ .yandjr + rfy 

z and z + dz is ~e~^dz. 


Z 
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We now want to find the frequency of cases in which 
U «= X + aZ has a value between u and u + du, and V = Y + JZ 
has a value between v and v + dv. 

Suppose Z lies between z and z + dz, which happens in the 


proportion ^eT^’^dz of cases. 


Then in order to produce such 


a pair as is considered^ X must be taken between %i - az and 
+ while Y must be between v-hz and v + dv-lz\ 

the probability of these happening together is 


i/tT 


Therefore the frequency of cases in which U lies between 
u and It, + du, while V lies between v and v + dv, is 

hkl. 


^^^ditdvr 

j-oo 


or 


hM aiL* .t. . (auh*+hvk*)* /•» , «.1 1 I Tlx f- attA*+6j;fc* '^2 

-ydudve’'^ *"*’*' +oW+6*A,«+r* j ^[‘"o*A*+6**;» + J*/ dz, 

TT* 7 — 00 


or 


JihJ . iauh*+VL'k*)* 


Tr(a%a + 6*yy* + /*)* 

If we examine this expression we see that it is of the form 

</.(«, v)dudv=ce~’^‘~^'^+^'’dudv, ( 1 ) 


where e, p, q, s denote constants. The constant s would he 
zero if either of the constants a or & were zero, i.e. if no common 
influence acted on the two riflemen. The correlation is repre¬ 
sented analytically hy the occurrence of this term in uv in the 
exponential. If tins term were absent, the expression (1) could 
be regarded as the product of two factors 

Constant X and Constant x 


of which the first involves u onl 7 and the second involves v 
only, so that in this case (§ 167) there would be no correlation. 

The expression (1) may be regarded as the extension to two 
variables u and v of the normal law of frequency for one 
variable u, 


Xe-''^du-, 
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on this account we shall call it the normal law of frequeney for 
two variailes. The constant c in the expression (1) may be 


determined from the condition that 



^(it, v)dudv = 1; this 


gives 


c = -(pY-s2)i. 

TT 


( 2 ) 


The above is a particular case of the following more general result.* 
Let Wp be variables, such that the probability of Ui 

lying between vi and Vi + dvi is 





Let ajp iCg* • • •> ®’p ^ linear functions of the u% defined by 

equations 

X2 = /C2lU^ + k^7l^ + . . . + 

Xp = kpjUj^ 4“ ^ p 2^2 + • • • + 


and let 




Then the probability that xj lies between ^ and + d^j is 




1 H 1 H 2 • 




where 6jjfc = (- O'. L 2, . . . />) and E = 

while D denotes a determinant of order p taken from the array 


^^11^12 • • • 
^21^22 • • • 


®Pl®P2 • * • ®^p<r> 

and Bj denotes a determinant of order (p - 1) taken from the array 
which is obtained by omitting the jth row in the above array. The 
two determinants Dj and Djt in the product are to be formed from 
the same columns of M. 

169. Bertrand’s Proof of the Normal Law. —It was re¬ 
marked by Rertrand f that the normal law of frequency for two 
variables may be deduced from an assumption resembling the 
Postulate of the Arithmetic Mean, from which, as we have seen 

* Cf. M. J. van Uven, Proc, Amster. Ac, 16 (1914), p. 1124. 
t Comptes Jie7i4fi8f 106 (1888), p. 387. 
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(§ 112), the normal law of frequency for one variable may be 
deduced. 

Consider for definiteness the shots of a rifleman at a target; 
and let the horizontal and vertical deviations of a shot from 
the centre of the target be called A and B. It is found in 
practice that A and B are not independent, but are to some 
extent correlated. 

Let us now make the following assumption, which was 
first proposed by Cotes: that if any number of shots have 
struck the target in points P^, Pg,..., Pn, then the most p7^obahle 
position of the point aimed at is the centroid {centre of gravity) 
of these points. 

Suppose the probability that a shot strikes an element of 
area dxdy at {x, y) is 

F(X - a;, Y - y)dxdy^ 

when X, Y are the co-ordinates of the point aimed at. Then 
if the co-ordinates of the points P^, Pg, . . ., P^ are y^, 
(^ 2 » y^> • • •> Vf) respectively, the product 

F(X-a:i,Y~2^JF(X-a:g, Y-yJ . . . F(X - a^n. Y - y,) 


must be a maximum, when X and Y regarded as variables have 
for values 


^__a!2 + ir2 + . • • + x<f^ 


If then we put 

d\ogY{x,y) 

dx 


n 


- =<^(a=.y). 

the functions (f> and ^ must be such that the equations 




^(“l> ^l) + ^j) + • • • + ^n) = 

lAK. ^l) + ^a) + • • • + ^n) *= 0 J 


( 1 ) 


are- the neoessaiy consequences of 

Oj + . . .+ o„ = 0| 

^i + * • • + /^n“0r 


(2) 
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Therefore 

^(-“*-“8-* • •-%. -Pa~p3-‘ • .-A.) 

+ ^(“»^8) + - • ' + •#>{“«> 

is ideuticollj zero; and therefore differentiating partially with 
respect to og and denoting the partial derivatives of <l>{x, y) with 
respect to x and y by and respectively, we have 

“ ^l( Si* ^2 • • • *“ At) "*■ ^ 2 ) ~ 

so i>i{x, y) is independent of x and y : denote it by a. Similarly 
<l> 2 (x, 2 /) is a constant b, and so 

y)=€u^ + by + c. 

Since equations (1) are consequences of equation (2), we see 
on substituting this value of <l> that c is zero. Thus 

4>{x, y)=-ax + by. 

Similarly ^{x, y) = iCx + Vy, 

where and V are constants. 

Since ^ and ^ are partial derivatives of the same function, 
we must have b = a\ and thus, integrating, 

log ¥{x, y) = + bxy + \Vy^ + constant. 

Therefore F(aj, y) is of the form 
Constant x 

and the normal law of frequency for two variables is thus 
established. 

The study of normal frequency distributions in two and three 
variables was begun by August Bravais in a celebrated memoir, Sur les 
prohabilites des erreurs de situation d!un pointy published in 1846.* 

j&ac .—The displacement of a point is the vector sum of n displacements^ 
and the prohahility that the i^^ of these displacements has a value between 
(xiy yi) and (xi + dxi, yi + dy{^ is 

84 

_Lg - (p^ifCi^+^fisKiyi+yiyi^^xadyi^ 

IT 

where 8^ = a^yf - (t=l, 2, . . w). Show that the probability that 

the total displacement has a value between (x, y) and (x + dxy yH- dy) is 

S* 

- e “ (“**+ 

TT 

• M4m. Sav. tlrang,^ Paris, 9 (1846), p. 265. 
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where ac 


A_ 

'A’ 

defined by the equatione 




C 

A* 


numbers A, Bj C, A 


In 

’ ^ 


B = 


V A 


C=2|!?, A = AC-B* 




(d’Ocagne.) 


160. The More General Law of Frequency. —In § 86 we saw that 
the normal law of frequency for one variable was a special, though 
frequently occurring, case of a more general law of frequency. Denoting 
the probability of a deviation between x and as + da; by </>(a;)da;, then in 
this more general law ^(as) is represented by an infinite series, whose first 
h 

term is -j- e-^***, and whose subsequent terms are obtained from this 

term by differentiation. The normal law corresponds to the case when 
the infinite series reduces to its first term. 

Similarly in the case when there are two variables we may derive a 
more general law of frequency than the normal law, represented by 
y)dxdyj where 

0 / 0 / 02 / 02 / 02 / 

#!,y) = CoA*>y) + «iog + eoi^ + «ai^ + eu^ + Co2^i5+. .. 

where /(», = when the oiigin is suitably chosen, or 

more generally 

/(aj, y) = 

where p, g, «, m, n are constants. 


161. Determination of the Constants in a Normal Fre¬ 
quency Distribution with Two Variables.—Let h be the 
smallest step recognised in measuring x, and let k be the 
smallest step in measuring y\ and let the probability that 
the first attribute A has a measure between x and x-\-h, while 
the second attribute B has a measure between y and y + A, be 
i>{x,y)hk, where (§ 158) 


Let it be required to determine the most probable values of 
the constants p, q, s, a, J, from a set of observations. Let the 
measures of the individuals observed be {x^, yj, y^), . . 

Then the a priori probability that the observations 
will yield these measures is 

JluTm 
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The most probable hypothesis regarding p, q, a, a, i is that 
which makes this quantity a maximum when {x^, y^, {x^, y^,.. 
(*»• yj supposed given. Taking logs, we see that 

n= log - s®) - «)* 

+ 2s2{x^ - a){y^ - 6 ) - q^iy^ - 6 )® 

must be a maximum, and therefore 


®.o S-o ^3-0 5-0 5.0 

On ' 04 ■ 0p dq “■ 0s “• 

The first two of these equations are 

0 = - s^(x^ - a) + - 6 ), 

which, since pq + 5, give at once 

n ^ 


and 


n 


The other three equations are 

Let us denote ^ 2 ( 0 ;^-a)® by o-j®, ^2(yj-&)® by wj®, and 

—i—2(a;i-rt)(yi-&) by r, so that the three numbers o-j, o-g, r 

may be calculated from the observed measures. Then the 
three preceding equations may be written 

Each fraction is evidently equal to 2a a- iJ \ 1 - )’ 

therefore 
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Thus the values of p, q, r are given by 

a_ 1 _ f 1 

^“2(1-»>!*’ *“2(l-?-»)(ria-j’ ^ 2(l-7-*)<r,»-’ 

and replacing j?, g, s by these values in the frequency function 
we see that the prohahility that the first attribute has a measure 
between x and a; + A, while the second attribute has a measure 
between y and y-vh^ is y)hk, where 




27r(rja‘2 >/(l “ r^) 


2(1 


-r2)\ oi* 


(g-o)»_2r(a;-a)(y~!)) . 


^2^ 
(Tja J 


( 1 ) 


and where the comtants a, 6, o-j, o-g, r are eocpressed in terms of 
the observed measures by the equations 


r = r:^2(a^ - a) {y^ -1). 

Wo-jO-g 

These formulae enable us to determine the most probable 
values of the constants of a normal frequency distribution in 
two variables in terms of observed measures. 

Ex.—A man is firing at a target, aiming at its centre. Taking this 
centre as origin, the co-ordinates of the points struck by the bullets in n shots 
are (Sj, • • •, (an, Writing 

+ + . . . + Xn*) = <r^, 

^(yi*+y8*+. • •+yn*)=<^8*, 

^!^yi + a^j + . . . + a„y„)=<ri<rj,r, 
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show that in the long run one-half of the joints struck will lie within the 
ellipse whose equation is 


2(1 


1 /as* ^rxy y*\ 


= 0-69316. 


2 '^2 - 

(Bertrand, G.R, 106 (1888), p. 621.) 


162 . The Frequencies of the Variables taken Singly.— Let 
us now find the probability that the attribute A lies between 
X and X + dx, when the attribute B is ignored, in the normal 
frequency distribution which has been studied. By (1) of the 
last article, the required probability is njlxy where 


_r 

® 27r<ri<r2*y(l-r*)j_a 


_!__ 2Hx-a)(y-h ) (y-h)2 i 

^ (Ti* oicra 


Performing the integration, remembering that 


/: 


we have 


(x^yi 

— 7pr\^ • 


This equation shows that a-^ is the standard deviation for the 
attAhute A, when the attribute B is ignored. Similarly o-g is 
the standard deviation for the attribute B. 

Denoting by y)dxdy the probability that the attribute 
A lies between x and x + dx while the attribute B lies between 
y and y + dy, and denoting by g[x, y)dy the probability that B 
lies between y and y + dy when A is known to be between x 
and X + dx, we have 

Substituting in this equation the known values of and 
wo obtain g(x, y), and thus find that the prohahility that B 
lies between y and y + dy, when A is known to he between x and 
x + dx, is 

1 ^ 2if=7S^V <ri / j 


a,V{27r(l-rT 


dv. 


This is a normal freguenxy distribution about the mean 
} + yyith the standard deviation 
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It follows from this that if (with a great number of observa¬ 
tions) we find the mean of all the measured values of B for 
which the measured value of A lies between cg and x + dcc, and 
if then we plot x, the plotted points will lie on the 

straight line 

Similarly if x^ denotes the mean of all the measured values of 
A for which the measured value of B lies between y and y + dy, 
then Xu plotted against y gives the straight line 


i-h=^Hx^-a). 


These lines were called by Galton lines of regression. 

Ex, 1 .—To find the standard deviation of the difference between the 
measures of the two attributes A and B. 

The probability that (measure of A) — (measure of B) lies between 
a - 6 + a; and a ^h-^x + dx is evidently 

(v+a:)* , 2n»(y+a;) 

2a'<r,( 


dx ■ p 


or 


dx 




STTO'jO’g's/i —r* 




<ri ^ + <r2^~2r<riy2 ^^ 


-Xg-.Ji+rxtrifri 


2(<ri*+(r2*-2r<ri<r2) I dv, 6 2ai*(y2*(l~»’*) V in^i <r2^-2r<rio-2> 


;)* 


dx 


or 


2(oi*+o’g* - Zraiajj) 


V(2’r) + (Tj* - SWiO-j) 

Therefore if a-y denote the standard deviation of (cc ~ y), we have 
<r„2 = o-j2 + o'22-2r(rif72, 


and therefore 




So-jCT-g 


an equation which may be used to determine r.* 

Ex, 2.'j‘ —Shm that the standard deviation of the sum of the measures of 
A and B is 

(o-j* + o-g^ + 2r(r^o-2)i. 

Ex, 3,—Show that the standard deviation of the product of the measures 
of A and B is 

+ aVg® + ^raba-jO-^ + <rjVg2(i + r) 


* Of. TL Pearson, Drapers" Company Research Memovrs^ Biometric Series, 
IV. (1907). 

t Of. B. Pearl, Biomeprika^ 6 (1909), p. 437. 
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163. The Coefficient of Correlation. —We now approach 
the question, How is correlation to be measured ? 

We have seen that in a normal frequency distribution for two 
variables defined by the frequency function 




1 


1 Ux-a)^ 2r(a?-a)(y~&) (y-5)a \ 
2(l-ra)\ «n2 “■ o-iaa + / 


the existence of correlation depends on the presence of the 
term in {x - a) {ij - h) in the exponential, i.e. it depends on the 
coefficient r. When r is zero there is no correlation, since 
<l>(x, y) then factorises into the product of a term depending 
on X only and a term depending on y only. 

Consider the case when there is perfect correlation, i.e. each 
value of the measure of A occurs only in conjunction with a 
particular value of the measure of B, so that one determines 
the other. In this case the standard deviation of B, when A is 
known to have a definite value, must be zero; that is, by the 
last section, - r^) must vanish, and therefore r must have 
the value unity. 

Thus r = 0 corresponds to the absence of correlation, while 
r = 1 corresponds to perfect correlation. It is therefore natural 
to take either r itself or some power of it, such as r® or ^/r, as 
the numerical measure of the correlation between the attributes 
A and B. To decide which power is most suitable,* let us 
recur to the case of the two riflemen. If we suppose, in the 
notation of § 158, that a and h are each unity while h, k, I are 
equal to each other, the frequency function becomes 


A2 

7r^3^ 


8 


'Ih^uv 2h*v* 

3 " 3 

» 


which corresponds to = ^ 2 , r = But in this case 

exactly half of each man’s mean error is due to the common 
element (the wind), and it would seem natural to take the 
measure of correlation to be We therefore decide that t 
itself is the most suitable numerical measure of correlation. It 
is called the coefficient of coiTclation, 

* KapteyD, Monthly Notices B, A, S. 73 (1912), p. 518. 
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Let (xp 1^2)’ * * * ^ ^ great number of measures of the 

attributes A and B; and let points having these co-ordinates be plotted. 
The points will cluster round the mean x = a, y = b; and if we draw 
rectangular axes through this mean point, the majority of the points 
will be found in the first and third of the quadrants formed by these 
axes when r is positive, and in the second and fourth quadrants when 
r is negative. 

164. Alternative Way of computing the Correlation Coefficient. 

—We have seen in § 97 that the standard deviation of a normal 
frequency distribution in a single variable may be found in many 
different ways from given observational material; the same applies to 
the coefficient of correlation in a normal frequency distribution in two 
variables. Thus to obtain the correlation coefficient directly from the 
raw material without attempting to arrange either of the measurements 
in order of increasing magnitude, we may proceed as follows: First 
summing each column, find the mean a of the x*s and the mean b of the 
y’s. Take out all the x’s that exceed a, find their mean A, and find also 
the mean B' of the corresponding y’s. Then 


T, = \/^(A-a), and B'~ 6 = —*(A - a) or B'- 6 = \/^. ro-. 
^52 0*1^ TT * 

nilarly take out all the y’s that exceed 5, find their mean B, and 
3 the mean A' of the corresponding x’s. We have 

A'-a = ^(B-«>) or A'-a=\/|.nrj. 


These equations give <7^, and r. In fact 
’’“B-.6 “ A-a’ 


165. Numerical Examples.— 

Ex. 1.—As a first example we shall consider the results of throws of 
dice made by A. D. Darbishire.* 

Twelve dice were taken, of which m were marked with red, the rest 
being white. All 12 dice were thrown together, and the number of 
dice showing faces with 4 or more pips uppermost in this throw w'as 
noted ; this number will be called the “ First Throw.” 

The red dice were left down and the white dice thrown again. The 
total number of dice (red and white) now showing faces with 4 or more 
pips uppermost was noted; this will be called the “ Second Throw ” 
corresponding to the “ First Throw ” previously made.j’ Evidently there 
will be correlation between the first and second throws. { 


* Mem. Manchester Lit. and Phil. Soc. 51 (1907), No. 16. 
t It may be shown without difficulty that the probability of the case in 
which the first throw is p and the corresponding second throw is q is the 
coefficient of in the expansion of 

(J)W-w(l +y3p)m(l +y)32“»»i(l 

t In fact, r=^. 
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For 500 pairs of throws of 12 dice, of which 6 were marked red and 
were left down and counted again in the second throw, the results were 
as in the following table, a pair of throws of (say) 2 and 6 being entered 
as a unit in the square at the intersection of the thiid row and sixth 
column. 


Second Throws 




0 

1 2 

3 

4 

5 

6 

7 

8 

9 

10 

11 12 Totals 


0 











0 


1 


1 

1 

1 







3 


2 


1 


2 

3 

2 





8 


3 


2 

3 

5 

6 

2 

6 




24 

1 

4 


5 

9 

8 

11 

le 

7 

G 

1 


63 


5 


2 

5 

17 

24 

19 

25 

11 

2 


105 

bd 

H 

6 


1 

5 

14 

25 

24 

24 

17 

4 

3 

117 


7 



2 

2 

13 

16 

27 

12 

4 

2 

78 

s 

8 




2 

7 

13 

22 

14 

5 

3 

66 


9 





3 

5 

6 

9 

5 

2 

30 


10 







2 

1 

2 


5 


11 








1 



1 


12 











0 

Totals 

0 

0 12 

25 

51 

92 

97 

119 

71 

23 

10 

0 0 600 


The means a and h are of course approximately * each equal to 6. 
Let denote the value of the mean of the second throws corresponding 
to the value x of the first throw ; then we have from the above table 


» 1 

! 2 

I ^ 

4 

5 

6 

7 ! 

1 8 9 

! 

10 ' 

1 11 

Vu 3 

j 1 

! 4-6 1 

1 

1 4-9 

5-2 

5*7 

6-1 

6*6 

1 7-0 1 7-5 

80 

! 8-0 


These lie very nearly on the straight line 




Similarly if denotes the value of the mean of the first throws 
corresponding to the value y of the second throw, we find very nearly 


Hence we liave <r,=«r, and r = i, nearly. The value of may bo 
* The computed values of the arithmetic means are a=6*960, i=6-10«. 


SO 






COKRELATION 


333 


Second Throws 



0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 


0 












1 




1 


1 






2 






6 

1 




00 

3 

1 

1 

6 

2 

2 

4 

5 





4 


1 

8 

6 

21 

16 

6 

6 




5 


4 

3 

12 

15 

23 

22 

9 

3 

1 

IH 

6 

1 


10 

16 

17 

23 

28 

22 

5 

1 

S 

7 


1 

4 

9 

17 

18 

24 

16 

5 

3 


8 



1 

5 

6 

10 

14 

8 

7 

2 


9 




4 

3 

9 

6 

6 

2 



10 





1 

1 

1 

4 

3 



11 







1 





12 












Find the coefficient of correlation. 

Ex. 3 .—JVhen nine of the 12 dice were marked redy and were left down 
to be counted in the second throWy Darbishire^s results were as follows: 


Second Throws 



0 

1 2 

3 

4 

5 

6 

7 

8 

9 

10 

11 


0 












1 

1 

1 










2 

2 

5 

1 

1 







1 

2 

3 


5 

7 

3 

1 






4 

1 

8 

18 

19 

5 

1 





PS 

ta 

6 


6 

17 

30 

32 

13 

1 




H 

6 


1 

10 

18 

34 

26 

10 

1 



H 

S 

£ 

7 



4 

17 

26 

30 

18 

7 



8 





7 

28 

16 

11 

5 



9 





3 

6 

15 

9 

7 

1 


10 






1 


4 

3 

2 


11 1 1 
12 


Fmd the coefficient of correlation. 

Ex. 4 .—In the following table, which is due to Weldon,* x denotes the 
length of the carapace of the common shrimp, y denotes the length of the post^ 
spinous portion of the carapace, x^ denotes the mean of the values of x 
corresponding to a definite value of y, and denotes the mean of the values 
of y corresponding to a definite value of x. The mean value of x is 249*63, 
its standard deviation is 6*73; the mean value of y is 177*63, and 
its standard deviation is 6*18. 


Proc. R.S. 47 (1890), p. 445; 61 (1892), p. 2. » 
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X, 


Over 260 

188-41 

260 

185*41 

259 

183*25 

258 

182-25 

267 

182*34 

256 

182*22 

255 

181*14 

254 

179-98 

253 

179-60 

252 

179-17 

251 

178-68 

250 

177-71 

249 

177-39 

248 

176-64 

247 

176-36 

246 

175-20 

245 

173-56 

244 

173-31 

243 

173-33 

242 

172-81 

241 

171-30 

240 

169-57 

Under 240 

170-33 


y* 


Over 186 

262*11 

186 

258*25 

185 

256*15 

184 

256*84 

183 

254*88 

182 

254*18 

181 

253*28 

180 

261-73 

179 

251*34 

178 

249*78 

177 

249*10 

176 

248*53 

175 

246*79 

174 

245*73 

173 

245*02 

172 

243*89 

171 

243*67 

170 

241*28 

169 

241*06 

Under 169 

239*88 


From these data show that ^-?=0’67, = 0*975, and consequently 

(Tj roTi 

= 0*83. 


Ex, 5 .*—An urn containing white and black halls is so maintained 
that in drawing a hall the probability of getting a white hall is a constant 
p, and that of getting a black ball is q=l-p. The first drawing of a pair 
is to consist of s balls taken one at a time from the um. The second draw¬ 
ing is to consist of s balls of which t are taken at random from the s first 
dravm, and s — t are drawn one at a time from the um. Show that the 
coefficient of correlation between the number of white balls in the first and 
second drawings of a pair is tjs, 

166. The Coefficient of Correlation for Frequency Distri¬ 
butions which are not Normal. —The theory may be extended 
to frequency distributions which are not normal in the following 
way.f 

' * H. L.<Rietz, Annals of Math, 31 (1920), p. 806. 

/ t Of. 0. V- Yule, JProe. Boy, Soe, 60 (1897), p. 477. 
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Let X and y denote deviations from the means of the 
measures of the two attributes. Let denote as usual the 
mean of all the values of y which are observed in association 
with a given deviation x of the attribute A from its mean. 
Then in the case of normal frequency distributions we know 
(§ 162) that if the values of are plotted against the corre¬ 
sponding values of x, the representative points lie on a straight 

line, namely, y^ * hx, where h has the value 

In the case of non-normal frequency the points will not in 
general lie on a straight line, but let us try to find a constant 
h which will satisfy all the equations 

yM2 “ 

• • • 

as well as possible, when x^, iPg* • • • observed values of 

x^ and //ii 2 i • • • the associated values of yn* We shall 
suppose that to the equation ^ weight is attached, 

equal to the number of observations on which it is based, say 
From these equations of condition we have at once the 
normal equation for h, 

where the summation is over all the distinct values of x. This 
equation is evidently equivalent to the equation 

where the summation is now not over all the distinct values of 
X, but over all the observations, so that the same value of x^ 
occurs times in the sum. 

If as before we write for the mean of x\ for the mean 
of y^, and o-^iT^r for the mean of ay, we see therefore that the 
straight line which best fits the points {x, y^) is the straight 
line 

just as in the case of normal frequency distributions. ^ Similarly 
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the straight line which best fits the points (a'^, y) is the 
straight line 


We may then call the number r defined in this way the 
coeffieient of eorrdation of the two attributes, even though the 
frequency distribution is not normal. 

167. ^e Gorrelation Ratio. —^A more satisfactory method 
of estimating the degree of correlation in a non-normal 
frequency distribution is by means of the eorrdation ratio* 
which is defined in the following way. 

Let the total number of individual observations be N, and 
let N<f)(», y)dxdy be the number for which the attribute A lies 
between x and x + dx, while the attribute B lies between y and 
y + dy. Let Nnje</a:, where 


J — to 


be the number of individuals for which A lies between x and 
x + dx, and let y^ denote the mean value of y for this set, so 
that 

y4>¥>yyy- 

J — to 


There will be high correlation if the y’s of this set are always 
clustered closely around the value yt,, i-e. if the standard devia¬ 
tion of these y’a is always small Denoting this standard devia¬ 
tion by we have 

= f (y- y-) y)dy. 

J-co 

If (Tjj* is to be small for all values of x, its weighted average 
must be small. We shall denote this weighted average by 0^, so 

/ a ro 

n^fdx = / / (y - yi.) y)dxdy. 

-CO y—00 y—00 

Now the standard deviation cr, is given by the formula 

<^2* = f f (y- y)da:dy. 

y-oo y -00 

♦ Of. K. Pearson, “On the Theory of Skew Correlation,” J^rajpers’ Researck 
Mtm, a (19051. 
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If in this we write {(y - y„) + (y„ - &)}* for (y - &)*, we obtain 

<rj® =■ 0= + f f 2(y - yM)(yH - y)<Wy 

/-oo y—oo 

+ r f (Vh - y)<fe<iy. 

y —00 J —00 

Since / (y - yy)4^{x, y)dy is zero, the first of these double 
/—00 

integrals vanishes. So if we define a new number by the 
equation 

f (y»i-i)^<t>(,x,y)dxdy, 

^2 — CO — 00 

the last equation becomes 



Since high correlation is associated with very small values 
of 6, we see that high correlation is associated with values of iq 
nearly eqvuil to unity. If, on the other hand, there is no 
correlation, there is no reason why the mean of the y*s for each 
separate value of x should differ systematically from the mean 
of all the y% and we may therefore expect to be small, 

and consequently 77 to be small. 

The number 17 is called the correlation ratio.* From the 
definition, we have 

^2 •'-00 

SO that 77 * is the weighted average of (^m - W divided by ir^. 

168. Gase of Normal Distributions.—We shall now show 
that when the frequency distribution is normal the correlation 
ratio is identical with the correlation coefficient r. 

For in the case of normal frequency distributions, as we 
have seen (§ 162), we have 

Vu. - & = —(.zj - a), 
where a is the mean of all the x% 

80 v*=^(^*)y %(*- 

* There is, of conrse, a second correlation ratio obtained by interchanging the 
parts played by x apd y throughout. 

(dBU) U 
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Since j - a^dx * 

this gives ly**?*®, which establishes the proposition. 

169. Contingency Methods. —It frequently happens that 
the attributes whose correlation we wish to discover are of 
such a nature that they do not admit of quantitative measure¬ 
ment— e.g, different colours,—and the groups into which they 
are classified cannot be arranged in a sequence possessing a 
logical order. To meet this case, what are known as contingewg 
methods have been devised by K. Pearson.* 

Let A represent any attribute, and let it be classified into 
groups Aj, Aj, . . ., A*; let the total number of individuals 
examined be N, and let the numbers which fall into these 
groups be w, respectively. Then the probability 

u 

of an indiridnal falling into the j^th group is Now let the 

same population he classified according to any other attribute 
into the groups Bg, • • B^, and let the group frequencies 

of the N individuals be m^, m^, . . req)ectively, so that 
the probability of an individual falling into the jth group is 
m 

Then by the theorem of Conjunctive Probability, if the 

two attributes were entirely uncorrelated, the probability of 
an individual falling into the group A^ and also into the group 
n m 

Bj would be “Ifr* number of individuals to he expected 


satisfying these conditions would he which we shall 

N 


denote by Vf^ Let the number actually observed as satisfying 
these conditions be Then the differences in so 

far as they are tqrstematic, represent the correlation of the two 
attributes, and some function of them iray be taken as a 
measure of the correlation. Pearson introduced two of these, 
namely, the root-mean-sqmre contingency ^ defined by the 
equation 




1 ~ ''pg. 

N v. 






* Draperi Co. Re*. Mem,, Biom. Series, i. (1904). 
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and the mean corvlingeney tf/, defined by the equation 

where S' denotes summation over the positive contingencies 
only. 

170. Case of Normal Distributions. —We shall now show 

that when the frequency distribution is normal, the root-mean- 
square contingency <l> is connected with the eorrdation coefficient r 
by the equation 

^•2 

<)l)* = or arc sin r = arc tan 


For when the frequency distribution is normal, taking the 
origin of x and y at the centre of the distribution, we have 
(§ 162) 


-2 = 


ni.. 


so 


and 


N <riV'(27r) ’ N <rgV(2r) 

N 


1 - -*'* 


2ir(rj<r2 

N 




W,0-27(1 - 1 ^) 


_ 1 / 
~ 2(1 -1 


Therefore 


^ _liaL = —^ 

Nv 


27r(rj(r^ 


l+r* a;* 2rxy l+r* ?/* 

1 "2(l-r*; oia'‘'oioriCl-r*)“2(r-r2)‘ orj* 

~\C 


1-7 


o 1 j 

Substituting this expression in </>*= f f dxdy, 

J —CO J — CO ^ 

and remembering that 

r. vfJCT)- 

1 


I 


we have 


.-2 + 1- 


1-r*’ 


which is the required result.* 

* For otlior deductions, cf. W. P. Elderton, Frcguency Cuvves and Corrtdatian 
(London, 1906), p. 148. j 
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171. Maltiple Normal Correlation.— We shall now extend 
the theory to the case when more attributes than two are con¬ 
sidered. For simplicity we shall suppose the number to be 
three, but the formulae admit of an obvious generalisation to 
the case of any number. The frequency distribution will be 
supposed to be normal, so the probability that the attribute a 
has a measure between x and x + dx, while the attribute /8 has 
a measure between y and y + dy, and the attribute y has a 
measure between z and z + dz, is <f>{x, y, z)dxdydz, where 


<#•(«, y.«) = K« 


- (a»>+by*+ 


( 1 ) 


and K, a, h, e, /, y, h denote constants, the origin having been 
taken so that the mean values of x, y, and z are zero. The 
constant £ may be determined at once from the equation 




<#•(». y> z)dxdydz = 1; 


for, remembering that if F(a!i,*„) is a positive 
quadratic form and A its determinant, then 


r» ACO ACO ACO 

/ —oo /-QO y-oo y —a 


— F(a^, 






we have 



where 


A» 


a h g 
h b f 
9 f c 


= abe-^2fgh-ap-bg'^-ch\ 


( 2 ) 

(3) 


By integrating the expression (1) we readily obtain the follow¬ 
ing results: 

The probability that a is between x and x + dx, while )8 is 
between y and y + dy,y being disregarded, is 


1 Ai 

* dMy. 


(4) 


where A, B, ... are the co-factors of a, h, ... in the 
determinant A. < 
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Comparing (4) with the usual form of the frequency function 
for two variables, namely, 

1 - V / g* 2rxy y« \ 

2(l-r*)V^ oi<r2‘^cr2*/^ 


2jr(rj<r27(l - 
H 


'dxdy. 


r = 


. 2 _ 


A 

'2A’ 


'2A' 


The coefficient of correlation of the attributes a and p is therefore 

We shall denote this by Introducing similarly 

the coefficient of correlation r^ between p and y, and rjg 
between a and y, let us consider the determinant 


E = 


'12 


'21 

^31 


^13 

^*23 

1 


and denote by the co-factor of the element in the ^th row 
and jth column. We have 

E = 


E„ = 


1 

H 

v'(AB) 

G 

H 

7(ab) 

1 

F 

7(AC) 

7(BC) 

aA p 

AA 

BC’ “ 

CV{AB)’ 


O 


A2 


VJT I 

,/{rC)'j“ABC’ 
F 

^(BC) 

1 


_ g'A 

^»“B,y(AC)’®'®' 


These equations may be written 

^ ~ gAcr ^ °°° 2Rcrj*«, Rj 2 = 2 hcTjerj/l, etc., 

1 2 '^3 

E„ , E. 


so we have 




= h - -12- php 

2E(ri2' ^'”2Eo-i(r2' ' 


and thus finally expressing the frequency function in terms 
of the correlation coefficients and standard deviations, the 
probability that the attribute a has a measure between x and 
x + dx, while p has a measure between y and y + dy, and y has a 
meagre between z and z + dz, is 

<b{x, y, z) dxdydz, 
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where 

Similarlj in the general case* when there are n variables 
^ 2 ' • • •* standard deviations are ar^, o-g, . . o-^, and 

whose correlation coefficients in pairs are r^y r^, . . the prob¬ 
ability that the first attribute has a measure between and + dx^y 
while the seco 
and so on, is 

where 


while the second attribute has a measure between oig and x^ + dx^t 


0(^i> iCg, • . x^^dxfix>2 

■L 2rV # 


. dx„ 


(27r)^ 


' 1*^2 




RsaJa*.! 

. +251^^+ . . A 

<r^ 


0102 


1 

^12 

^13 • • • ’in 

, and Rp, 

^’si 

1 

’’23 * * • ’*2n 




’’»i3 • • • ^ 



Here B denotes the determinant 


denotes the co-factor of the element in the j>th row and ^th 
column. 

Ex. 1 .—In the case of three attributes, suppose that y is known to have a 
measure between z and Z’\-dz: show that the probability that a has a 
measure between x and x + dx, while /3 has a measure between y and 
y + dy, is 

1 


08 




2Hx^^c^2^/K 

so tliat when the measure of y is known to be z, the mean value of a ie 

and the mean value of B is -^^z. 

^3 .3 

Ex. 2 .—In the case of three attributes, suppose that y is known to have a 

measure between z and z-\-dz, and P to have a measure between y and 
y + dy. Show that the probability that a has a measure between x and 
x + ck is 








o^Ria 


03B11 / . 


dx. 


(22rR)*orj 

so tliat when the measure of y is known to be z^ and the measure of ^ is 
known to be y, the mean value of the measure of a is 


^ . «r,B;7 <r3B„ 

• K. Pearson, Phil. Tfans. 187 A (1896), p. 263; 200 A (1902), p. 1. 
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THE SEARCH FOB PERIODICITIES 

172. Introduction.—In Chapter X. we have been concerned 
with sums of trigonometric terms of the tjpe 

cos {nt + + fltj cos {2nt + e^) + Oj cos {3m< + 63 ) + ... (1) 

As explained in § 132, the vibration of a violin string may be 
represented by a series of this kind when t denotes the time and 
Uj, a,, . . . are certain functions of position on the string, the 
individual terms of the series corresponding to the fundamental 
note of the string and its various overtones. 

It is shown in works on the Theory of Sound that if instead 
of a violin string we consider a bar vibrating laterally {e.g. a 
tuning fork), we obtain for the motion at a definite point of 
the bar a series of the type 

Uj cos J cos + e,) + a, COB {nj, + tj) + . . . (2) 

where are certain functions of position on the 

bar, but where we now no longer have equal to twice or 

equal to three timesin fact, the ratios . . . 

are equal to the ratios of the squares of the roots of the 
equation cos m cosh m + 1 = 0, so that : Wj: tij : . . . =■ 
3*52 . . .: 22*03 . . .: 61*70 The sum of a series 

of the type (2) is evidently not a periodic function of t, 
but we can speak of it as constituted of elements which are 

periodic, the periods being etc. 

In many branches of physical science, especially in meteoro¬ 
logy and astronomy, phenomena are observed which may be 
represented by sums resembling (2); for example, the height of 
sea-water at any instant depends on a number of wn^ituent 
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tides, of which one (the semi-diurnal tide, which is the largest 
constituent tide along the British coasts) has a period of half a 
day, another (the diurnal tide) has a period of a day, another 
(the fortnightly tide) has a period of nearly a fortnight, and so 
on. Each constituent tide produces its own effect independ¬ 
ently of the others, and the actual height of water is the sum 
of these effects. The height of water can therefore be repre¬ 
sented by an expression of the form 


y = + cos (j 4 < + ei)+ 0,008 (nj< + «j)+ . . . + o, cos (n^^ + «,), 


each of the trigonometrical terms corresponding to one of the 
constituent tides. 

In the case of other phenomena, e.y. the spottedness of 
the sun, the variation of the observed quantity appears to 
consist of an accidental or capricious part, which cannot be 
represented by any analytical expression, superposed on a 
systematic part, which the mathematician may attempt to 
represent by an expression of tbe form (2). The area covered 
by spots on the sun certainly fluctuates in a way which 
suggests a certaiu amount of regularity in the variation, 
maxima occurring at intervals of (on the average) rather 
more than eleven years. 

If a series of observations of any quantity are taken, and 
there is reason to expect that they can be represented by a 
sum of trigonometric terms, or that they involve (entangled 
with an irregular variation) a regular variation which can be 
represented by a sum of trigonometric terms, then the first 


2ir 2v 

task of the mathematician is to discovei’ the periods —, —, . . . 

of these! constituent terma This must always be done before 
we attempt to find the amplitudes cq, a,, a,, ... or the phases 
In many cases the periods are known a priori 
from theory or from some reasonable ground of expectation: 
for instance, we should naturally expect the periods of the 
regular terms in the temperature at a given place to be a day 
and a year. But in many other cases, e.g. the spottedness of 
the sun, the periods are quite unknown. "We shall show in the 
present cl^apter how they may be discovered. 
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173. Testing for an Assumed Period. —Let the observed 
measures of the phenomenon, made at equal intervals of time^ 
be denoted by 

• • • ( 1 ) 

and suppose that it is desired to test this sequence for a 
periodicity whose period extends over p consecutive numbers 
of the sequence; might, for instance, mean the number of 
earthquakes in the year x, and we might wish to know whether 
the liability to earthquakes is greater every p years. Let 
denote the remainder when an integer r is divided bv p, so 
that the sequence 

^ 2 * • • • ( 2 ) 

is simply 

0, 1, 2, . . . ip-l), 0, 1, 2, . . . (p-ll 0, 1, 2, . . . (3) 


Then the question '' Does the sequence (1) involve the assumed 
periodicity?” may be expressed more precisely thus: **Does 
correlation eif%st between the sequence (1) and the sequence (2) 

As the frequency distribution with which we are dealing is not 
likely to be normal, the correlation ratio (§ 167) is a better 
method of estimating correlation than the correlation coefficient. 
To find the correlation ratio, we must first arrange the u’s in 
columns, so that all the i^’s which correspond to the same value 
of V are in the same column. This may obviously be done by 
merely writing down the in order in horizontal lines, each of 
which contains p n's thus: 


«0 


7/, 


% 





««p+i 

'*^a^>+2 • • 

• “Sip-l 



^^(w-l )/>+2 • • 


Sums Uq 

Ux 

u . 

Vi' 


All the u'b in the first column correspond to the value zero of 
V, all the u*B in the second column correspond to the value 
unity of v, and ^o on: we have taken enough of the observa¬ 
tional material to fill m horizontal rows, and we have denoted 
the sums of the individual columns by Uq, U^, 

( d 811 ) t2^ 
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Dividing these last numbers by m wo obtain the means Mg, 
My . . of the values of % in the individual columns. 

Then (§ 167) the correlation ratio r/ is tJte standard deviation of 
the M's, divided by the standard deviation of the u’s. The value 
of 1 / is calculated in this way for a large number of values of p, 
and the results plotted as a curve in which p is the abscissa 
and the corresponding value of t) is the ordinate. This curve 
will be called a periodogram.* 

It is easy to see why the ratio of the standard deviation of 
the M’s to the standard deviation of the u’a is a suitable 
indicator of periodicity. For in the comse of one horizontal 
row of the above scheme, the part of the phenomenon which is 
of period will pass through all the phases of one complete 
period, so that this periodic part is in the same phase at all 
terms which are above or below each other in the same vertical 
column, e^. it is in the same phase at the terms u^, Up+^ 

• • •» “^n-iip+r' phenomenon which is of period 

p therefore appears with w-fold amplitude in the row Ug, TJy ..., 
Up_y and therefore appears with its own proper amplitude in 
the row of means Mg, Mj, . . ., Mp-y Any accidental disturb¬ 
ance on the other hand, or any periodic disturbance of period 
different from p, will be enfeebled by the process of forming 
means, since positive and negative deviations will tend to 
annul each other; and therefore, when a periodicity of period 
p exists, the standard deviation of the M’s has a value much 
larger than when a periodicity of this period does not exist 
in the phenomenon. 

174. The Periodogram in the Neighbonrhood of a Tme 
Period. —Suppose now that each of the terms of the sequence 

^TriX/ 

consists of a simple periodic part of period T, say a sin 

together with a part which does not involve this periodicity, 
say 6^, so 

. 27rX , 

* The term periodogram was introduced by Schuster, Terrestrial MougfMtism^ 
8 (1898), p. 24. Schuster’s periodogram differa from tliat introduced above, 
but the similarity of form and purpose is so great that it has seemed best to 
retain the name. 
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Denote by the standard deviation of the b’e, and denote 
by (T the standard deviation of the u’b. Since the standard 

deviation of the sequence 0, sin sin sin is -L, 

11 1 1 

and there is no correlation between and a sin we have 

or* = + cr^*. 

Next, let Uj. denote as before the sum of +. . . 

+ denote the sum • • 

+ ^(m-Dp+*- Then 


. rmrp 

airi ^ 

or = «- - sin y - ] + B,. 

sin -,j/ 

Denote by 2 the standard deviation of the TJ's and by 2 b the 
standard deviation of the B’s. Then in the same way as we 
found cr, we find 


'**'2 — 

- 2^ 


sin*^^ 




Now, since is the sum of m of the ft^'s, we may write 
2 b* = mo-^*, and therefore 

1 Siu« 

2*= V- -~ + 7n,Tb\ 

Thus if 2 m denotes the standard deviation of the means 
Mq, Mj, . . of the individual columns, we have 


. 9 limy 
8in®-7j^ . 

_ 

2m* . m ® 

sm*-^ 
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Therefore, if -q denotes as usual the correlation ratio, we have 

m* —— * 

V - 


sm‘ 


8 winy 


2 m* . , irt) 

Bin*-^ 


m 




This is the equaiim of the pet'iodogi'am, when p and ij are 
taken as rectangular co-ordinates; or to speak more accurately, 
it is the form to which the equation of the periodogram tends 
as the amount of observational material, used in constructing 
it, is increased indefinitely. The number m will, in most cases, 
be taken greater than 20 if there is sufficient observational 
material to provide so many horizontal rows. 

From the above equation it is obvious that with such a 
value of m, is a rather small fraction, except when p is nearly 
equal to T. Let p = T(1 - €), where e is a small number; then 
as e tends to zero, the above value of tends to the value 


1 2 ^ 1 2 

2 m ® 




and as m is a large number, this is nearly 


1 + 


2<r** 


It falls 


away rapidly as e passes away from zero in either direction, 

and when « = ± —, v* becomes 
m 




which is the smallest value it can take for any value of p. 
There are maxima of rj again near the values of p given by 

± that is when e is nearly equal to ± Collect- 
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ing our results, we may say that when the phenomenon studied 
is a simple periodic disturbance of period T, superposed on a 
non-systematic disturbance, and the periodogram is computed 
with a large value of m, the periodogram curve is close to the axis 
of p concept when p is in the neighbourhood of T, where the curve 


2T 

has a peak of h'eadth —, flanked by smaller peaks on both sides. 

The recognition of these peaks in the periodogram is the means 
by which we discover hidden periodicities. 

175. An Example of Periodogram Analysis. —The table 
below gives the magnitude {Le. a measure of the brightness) of 
a variable star at midnight on 600 successive days. (These 
magnitudes were obtained by reading off from a curve, on 
which all the observations of the star’s brightness were plotted: 
they have been reduced to a scale suitable for periodogram 
analysis.) It is required to find a trigonometrical function 
which will represent the magnitude at any time L 


Day. 

Mag. 

Day. 

Mag. 

Day. 

Mag. 

Day. 

Mag. 

Day. 

Mag. 

1 

25 

21 

2 

41 

10 

61 

27 

81 

17 

2 

28 

22 

4 

42 

7 

62 

25 

82 

18 

3 

31 

' 23 

8 

43 

5 

63 

24 

83 

19 

4 

32 

24 

11 

44 

3 

64 

21 

84 

19 

5 

33 

25 

15 

45 

3 

65 

19 

85 

19 

6 

33 

26 

19 

> 46 

3 

66 

17 

86 

19 

7 

32 

27 

23 

47 

4 

67 

15 

87 

20 

8 

31 

28 

26 

48 

5 

68 

13 

88 

20 

9 

28 

29 

29 

49 

7 

69 

12 

89 

20 

10 

25 

30 

32 

50 

10 

70 

11 

90 

20 

11 

22 

31 

33 

51 

13 

71 

11 

91 

20 

12 

18 

32 

34 

52 

16 

72 

10 

92 

20 

13 

14 

33 

33 

53 

19 

73 

10 

93 

20 

14 

10 

34 

32 

54 

22 

74 

11 

94 

20 

16 

7 

35 

30 

65 

24 

75 

12 

95 

21 

16 

4 

36 

27 

56 

26 

76 

12 

96 

20 

17 

2 

37 

24 

57 

27 

77 

13 

97 

20 

18 

0 

38 

20 

58 

28 

78 

14 

98 

20 

19 

0 

39 

17 

59 

29 

79 

16 

99 

20 

20 

0 

40 

13 

60 

28 

80 

16 

100 

» 

19 
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Day. 

Mag. 

Day. 

Mag. 

Day. 

Mag. 

Day. 

Mag. 

Day. 

Mag. 

101 

18 

142 

12 

183 

19 

224 

15 

265 

27 

102 

17 

143 

16 

184 

15 

225 

15 

260 

29 

103 

16 

144 

19 

185 

12 

226 

16 

267 

30 

104 

15 

145 

23 

186 

9 

227 

17 

268 

30 

105 

13 

146 

27 

187 

7 

228 

17 

269 

30 

106 

12 

147 

30 

188 

5 

229 

17 


29 

107 

11 

148 

32 

189 

4 

230 

17 

271 

27 

108 

10 

149 

33 

190 

4 

231 

18 

272 

25 

109 

9 


34 

]91 

5 

232 

18 

273 

22 

110 

9 


33 

192 

5 

233 

19 

274 

19 

111 

10 

152 

32 

193 

7 


19 

275 

16 

112 

10 

153 

30 

194 

9 


20 

276 

12 

113 

11 

154 

27 

195 

12 

236 


277 

9 

114 

12 

155 

24 
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14 
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21 

278 

6 

116 

14 

156 


197 

17 


21 

279 

4 

116 

16 


16 

193 

20 


22 

280 

2 

117 

19 

158 

12 

199 

22 


22 

281 

1 

118 

21 

159 

9 


24 

241 

22 

282 

1 

119 

24 


5 


25 


22 

283 

2 

120 

25 

161 

3 


26 

243 

22 

284 

4 

121 

27 

162 

1 


27 

244 

21 

285 

7 

122 

28 

163 

0 


27 

245 

20 

286 

10 


29 

164 

0 

205 

26 


19 

287 

14 


29 

165 



25 


17 

288 

17 

125 

28 

166 


207 

24 

248 

16 

289 

21 

126 

27 

167 




249 

14 




25 

168 

9 




12 

291 

29 

128 

23 

169 

13 




11 

292 

31 


20 



211 



9 


Ea 


17 

171 


212 

■CT 


8 

294 

34 

131 

14 

172 

13m 

213 

14 

254 

7 




11 

173 

27 

214 

13 


8 

296 

33 

133 

8 

174 

30 

215 


256 

8 

297 

31 

134 

5 

176 

32 

216 

12 

257 

9 

298 

29 

135 

4 

176 

33 

217 

12 

258 


299 

26 

136 

2 

177 

33 

218 

12 

259 

12 

Kl !■ 

22 

137 

2 

178 

32 

219 

13 


14 

Kill 

19 

138 

2 

179 

31 




17 

Kf J 

14 

139 

4 


28 


13 

262 


K 1 9 

11 

mm 

6 

181 

25 

222 

14 

263 

23 



141 

? 

182 

22 

223 

14 

261 

25 
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Day. 

Mag. 

Day. 

Mag 

306 

2 

347 

25 

307 

1 

348 

25 

308 

0 

349 

25 

309 

1 

350 

24 

310 

2 

351 

24 

311 

5 

352 

22 

312 

7 

353 

21 

313 

11 

354 

19 

314 

15 

355 

18 

315 

19 

356 

17 

316 

22 

357 

16 

317 

25 

358 

15 

318 

28 

359 

15 

319 

30 

360 

14 

320 

32 

361 

14 

321 

32 

362 

14 

322 

32 

363 

14 

323 

31 

364 

14 

324 

29 

365 

14 

325 

26 

366 

14 

326 

23 

367 

14 

327 

1 21 

368 

14 

328 

17 

369 

14 

329 

14 

370 

14 

330 

11 

371 

15 

331 

9 

372 

15 

332 

7 

373 

15 

333 

6 

374 

15 

334 

5 

375 

16 

835 

6 

376 

16 

336 

6 

377 

17 

337 

7 

378 

18 

338 

9 

379 

19 

339 

11 

380 

20 

340 

13 

381 

21 

341 

15 

382 

22 

342 

18 

383 

23 

343 

20 

384 

23 

344 

22 

385 

24 

345 

23 

386 

24 

346 

24 

387 

24 


Day. 

Mag. 

Day. 

Mag. 

388 

23 

429 

5 

389 

22 

430 

8 

390 

21 

431 

12 

391 

19 

432 

15 

392 

17 

433 

19 

393 

15 

434 

23 

394 

13 

435 

27 

395 

11 

436 

30 

396 

9 

437 

32 

397 

7 

438 

34 

398 

6 

439 

34 

399 

6 

440 

34 

400 

6 

441 

32 

401 

7 

442 

30 

402 

8 

4*3 

28 

403 

10 

444 

24 

404 

12 

445 

20 

405 

15 

440 

16 

406 

18 

447 

13 

407 

22 

448 

9 

408 

24 

449 

6 

409 

27 

450 

3 

410 

29 

451 

2 

411 

31 

452 

1 

412 

31 

453 

1 

413 

31 

454 

2 

414 

31 

455 

4 

415 

29 

456 

6 

416 

27 

457 

9 

417 

24 

458 

13 

418 

21 

459 

17 

419 

18 

460 

20 

420 

14 

461 

23 

421 

10 

462 

26 

422 

7 

463 

28 

423 

5 

464 

30 

424 

2 
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31 

425 

1 

466 

31 

426 

0 

407 

31 

427 

1 

468 

29 

428 

2 

469 

27 


D4 


4' 

4' 

4' 

4 ^ 

4' 

4\ 

4\ 

4\ 

4\ 

4 \ 

4i 

4i 

4J 

4J 


49 

49 


49 


50 


50 

50 

50 

50 

50 

50 

50 

51 
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Day. 

Mag. 

Day. 

Mag. 

Day. 

Mag. 

Day. 

Mag. 

Day. 

Mag. 

-sii 

14 

529 

25 

547 

8 

666 

12 

583 

34 

512 

14 

530 

26 

648 

10 

566 

8 

584 

34 

513 

13 

531 

26 

649 

13 

667 

6 

585 

33 

514 

13 

532 

25 

550 

16 

568 

3 

586 

31 

515 

13 

533 

24 

651 

20 

569 

1 

587 

29 

516 

13 

534 

23 

552 

23 

670 

0 

588 

26 

517 

13 

535 

21 

553 

26 

571 

0 

589 

22 

518 

13 

536 

19 

554 

29 

572 

1 

590 

18 

519 

14 

537 

16 

555 

31 

573 

3 

691 

15 

520 

14 

638 

14 

556 

32 

574 

6 

592 

11 

621 

15 

539 

12 

557 

32 

575 

10 

593 

8 

522 

16 

540 

9 

558 

32 

576 

13 

594 

5 

523 

18 

541 

7 

559 

31 

677 

17 

595 

3 

524 

19 

542 

5 

560 

29 

578 

21 

596 

2 

525 

21 

543 

5 

561 

26 

679 

25 

597 

2 

526 

22 

544 

4 

562 

23 

680 

28 

598 

2 

527 

24 

545 

5 

563 

20 

581 

31 

599 

4 

528 

24 

546 

6 

564 

16 

582 

33 

600 

5 


We have first to find the mean value and standard deviation 
of the observations. By the methods of Chapter VIII. we find 
Mean value = 17, 

Standard deviation = 8-63. 

As there are on the whole about 21 maxima and 21 minima 
in the 600 days, we suspect that one of the most important 
periods will be not far from 600/21 days. We shall therefore 
take ttial periods ranging from 20 days to 32| days; that is, we 
shall give tojp in succession the values 20, 20|, 21, 21^, and so 
on to 32 J. Taking m«=«17, the summation process for, e.y., 24 
days is as follows: 


[Table 







1-H rH <M <M tH 


1-1 <M <M rH 


<M 


00 Tfl rH T—• tH 
tH <M 


COOlOCOC^'OO’^iOtOlOCOCq 
rH T-lC<lC<|rH tHC^Cq 


'Tj<COrHOT-HCvIc0^eOi-HOOC<l»OlOC<IOO 
rHC^G^rH i—l<MCQrH i—iC^i—I 

C^C0<MO05C5rH'^T*<rHJt-Jb-rHC0«0i-Hl0 
rH T—4 rH (M rH rH rH 

OCOCOOCOCOOiO»OOTj4Tj<OJL''Jt^O<M 
rHC^rH ^(MrH r-HC^rH 

OiOiOO^'^0^1r-OC<l<MrH05000sO 
rH<MTH tH(MtH rHrHrH 

O^i>*OC<lG^rH0a000iOTH<MrH050000 
rH CQ rH iH tH tH rH iH i—^ 

C^OO»OTHC<ICOC<100iOirHTi4'rJ4rHJt^r^ 
rH rH rH tH C<1 rH tH tH 

'^COrHOOG^iOlOCSIOOOOC<lJt-lr-CqcO«0 
iHdCqrH TH(MrH tHCQiH 

Jt— Ir^'^OO'^OiOa'^OOOO'^THrH'^COCO 
rHOqCQiH rHC<lrH iHiMOlrH 

OOi0 0505t0C<lCs|i0i^Jr^C0'^CCH-i050 

tH(M<MtH TH<MG^rH tH(MC^tH 

T*4«^r^oaoioocoio5or— oooiocoioiojt^ 

tHC^G^iH rH01C<lrH rH<MC^iH 

OOt^OOO:»OrHOOOJt^J>-asC^^ 05 ^JOCi 

THCl(MrHrHrH(MC^C^iH rHC^lC^C^lrH 


C^OOaOSrH’^'^rHl^-lr^rHCOOrHlOlOrH 

CQCOC^rHrHrHC^CO<MrHtHrHC^COG^THTH 

LO Cl 00 00 dCOCOdO^CSd'Tj^TjHCO 

ClCOCSIrHTHiH<MCO<MiHrHiHClCOClTHrH 

OOCOr^lr^COOOCOlOlO'^ClrHClcO'^^O I 
ClCOClrHTHClCOSOClrHrHClCOCOClTHrH 

rHT*4COCOiOCOClCO;^iOCOlOCOClCq;^£^ | 

COCOC^rHrHClCOCOClTHrHClCOCOClrHrH 

ClC0'^iO^>O<?0dCl*^ JC^ ^ ■rft O O Tt* ^ I 

eOCO^rHrHClCOCOClrHrHClCCCOC^IrHTH 
COCOClTHiHClCOCOClrHrHClCOClrHTHCl | 

cooaoacooooocoJr^Jt^cooocoio»^T*4^ i 
COC^i—liHrHC^CQClrHrHClCQCOClrHrHCl j 

CSi CO CO Cl Oi 05 Cl ^ ^ CO rH O rH d CO CO I 

COdrHrHrHClCOClrHrHClCOCOClTHrHCl j 

rHCOCOClOOiOrHdCOClOOiOai—I 
COdiHrHdClCOClrHTHdCOClrHrHrHCl j 

0005OrHO00l^l>-05g^O»0^05^^ I 

d^H^rHClClClrH iHddClrH rHd 

CQtH SdClClrH iHClClClrH iH^j 


285 319 353 371 389 406 407 408 392 376 359 326 294 259 242 208 191 174 173 172 188 204 238 254 
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The same at the foot of the columns are the numbers Ug, U^, 
U^ . . . corresponding to this trial period. 

When the trial period is not a whole number of days, we 
modify the arrangement slightly so ns to secure that terms in 
the same phase are still in the same vertical column: thus if 
the trial period were 31J days, we should write the values 
corresponding to days 1 to 31 in the first horizontal row, and 
the values corresponding to days 32 to 62 in the second 
horizontal row, then we should omit altogether the value 
corresponding to day 63 in order to bring the value correspond¬ 
ing to day 64 to the beginning of the third row, and so on. 

The table of values of the u’a obtained by the summation 
process with the different trial periods is as follows: 


[Table 
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We have next to find the standard deviation of each of 
these columns and divide it by m («17) in order to obtain the 
standard deviation of the corresponding column of means 

• • • • 

Now dividing these standard deviations of the M’s by the 
standard deviation of the u% which was found to be 8* 63, we 
have a table of values of the correlation'ratio -q corresponding 
to the different values of the trial period p. The values of ij 
may evidently be obtained at once by dividing the correspond¬ 
ing standard deviations of the columns by 146‘71 (= 17 x 8-63). 
The results are as follow: 


Period. | 

Arithmetic | 
Mean. | 

Standard 

Deviation. 


Period. 

Arithmetic 

Mean. 

standard 

Deviation. 


20 

291-4 

4-042 

0-028 

26j 

294-4 

16-837 

0-115 

20J 

293-5 

10-171 

0-069 

27 

290-3 

22-132 

0-151 

21 

294-6 

10-018 

0-068 

27| 

293-6 

6-590 

0-045 


293-7 

7-947 

0-054 

28 

293-9 

58-197 

0-397 

22 

295 

17-034 

0-116 

28| 

292-2 

109-019 

0-743 


293 

6-772 

0-046 

29 

292-3 

126-481 

0-862 

23 

294-9 

31-630 

0-216 

29J 

291 

106-089 

0-723 

23j 

294-7 

69-389 

0-473 

30 

292-6 

61-587 

0-420 

24 

291-2 

83-856 

0-572 

30j 

291-9 

15-948 

0-109 

24J 

294-3 

62-567 

0*426 

31“ 

292 

17-481 

0-119 

25 

293 

23-198 

0-158 

314 

293-2 

29-034 

0-198 

25J 

290 

7-697 

0-052 

32 

291-5 

16-972 

0-116 

26 

294-8 

23-561 

0-161 

324 

291 

4-020 

0-027 


These values of ij, plotted against the corresponding values of 
p, give the periodogram shown on p. 357. 

In the practice of periodogram analysis, since saving of labour is 
more important than great accuracy, it is not unusual to omit altogether 
the calculation of the standard deviations, merely plotting the periodo¬ 
gram from points obtmned as follows. As abscissa take p, and as ordinate 
take the difference between the greatest and least numbers of the sequence 
Ug, Uj, Ug, . ■ i this difference is called the oscillation correspond¬ 

ing to the trial period p. The table on p. 358 gives the oscillations 
corresponding to the various values of p, between 19 days and 33^ days, 
the number m being taken to be 17 throughout. 
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Trial Period. 

Oscillation of Sum. 

Trial Period. 

Oscillation of Sum. 

Days. 


Days. 


19 

30 

26| 

20 

191 

36 

26^ 

50 

20 

16 

27 

63 

20i 

32 

27J 

36 

21 

30 

27i 

22 

211 

27 

28 

169 

22 

51 

28i 

305 

22i 

35 

29 

360 

22} 

20 

29^ 

303 

23 

89 

30 

176 

23i 

204 

30| 

65 

24 

236 

31 

49 

24^ 

177 

31J 

86 

25 

74 

32 

53 

25} 

20 

32^ 

16 

25} 

23 

33 

68 

26 

25 

33i 

49 


It will be seen on plotting the oscillation against p that the curve so 
obtained closely resembles the periodogram obtained by the more accurate 
and laborious method of computing the standard deviations. 

It will be seen that the periodogram in our example shows 
two high peaks at = 24 and p — 29, with the side-peaks belong¬ 
ing to these as in a diffraction-pattern in optics. We are 
therefore led to infer the existence of two constituent oscilla¬ 
tions, one of which has a period of approximately 24 days and 
the other of approximately 29 daya In order to find these 
periods more exactly, we repeat the work so far as concerns the 
neighbourhocd of p = 24 and p = 29, but taking a larger value 
of m —say about twice as great—and also taking values of p 
separated from each other by smaller intervala Thus we might 
now calculate the correlation ratios corresponding to ^==23-6, 
23*8, 24*0, 24*2, 24*4, when 34 horizontal lines are taken. 
This will give a much better defined peak in the neighbourhood 
of 27» 24; the peak will in fact be only half as broad as in the 
previous periodogram. 

It may be remarked that if two periods are found (from an 
inspection of the first periodogram) to be so close together that 
the peaks correspondiog to them run into each other, it will in 
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any case be necessary to repeat the work with a larger value of 
m, in order to diminish the breadth of each peak and so bring 
the two peaks clear of each other. This is analogous to the 
corresponding device in spectroscopy, of employing a grating 
with a larger number of rulings in order to resolve two lines 
which are not distinctly separated by a smaller instrument. 

In order to get the periods still more exactly, we must study 
the phase of the constituent oscillations (as found by Fourier’s 
analysis) at different epochs; for if there is a slight error in 
the assumed period, the consequence will be that the phases 
of the oscillation, as determined from different " stretches ” of 
material, will not fit together accurately—the oscillation will 
appear as if it were continually being accelerated or retarded 
in phase; an improved value for the period is then suggested 
by the amount of acceleration or retardation of phase. In 
this way we find that in the present example the periods 
are exactly 24 and 29 days. We then add together the two 
oscillations {ix. write down the numbers M^, Mj,... for 
the first oscillation in one horizontal line, and write down the 
numbers Mj, Mg,... for the second oscillation in a hori¬ 
zontal line below them, and add) and subtract the result from 
the given observed values, in order to see what is left still 
unaccounted for. Thus 


Bay 

1 

2 

3 

4 

5 

G 

7 

8 

9 

2^-day term 

16-8 

18-8 

20-8 

21-8 

22.9 

23-9 

23.9 

24 

23.1 

29-rfay term 

25-7 

26-7 

27-8 

27.7 

27-7 

26-6 

25.4 

24.2 

22.1 

Sum 

42-5 

45*5 

48-6 

49.5 

50-6 

50.5 

49.3 

48-2 

452 

Given in graph 

25 

28 

31 

32 

33 

33 

32 

31 

28 

Biff. 

17-5 

17*5 

17-6 

17.5 

17 .C 

17.5 

17.3 

17-2 

17.2 

Bay 

10 

11 

12 

13 

14 

15 

16 

17 

18 

7A-day term 

22.1 

21.1 

19-2 

17.3 

15-2 

14.2 

12-2 

11.2 

10-2 

2^-day term 

19-9 

17.8 

15-6 

13.5 

11.4 

9.5 

8.4 

7-5 

6.6 

Sum 

42-0 

38.9 

34.8 

30.8 

26-6 

23.7 

20.6 

18.7 

16.8 

Given in graph 

25 

22 

18 

14 

10 

7 

4 

2 

0 


17-0 

16-9 

16.8 

16.8 

16.6 

I 6.7 

16.6 

I 6.7 

16.8 
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Day 

19 

20 

21 

22 

23 

24 

25 

26 

27 

24 day term 

10-2 

101 

IM 

12 

14 

14.9 

16.8 

18.8 

20-8 

29-day term 

6-8 

6.9 

8-1 

9-2 

11.4 

13.6 

16.7 

17-7 

19.8 

Sum 

17-0 

17-0 

19-2 

21.2 

25.4 

28.5 

32.5 

36.5 

40.6 

Given in grapl 

i 0 

0 

2 

4 

8 

11 

15 

19 

23 

Diff. 

17-0 

17-0 

17-2 

17*2 

17*4 

17.6 

17-5 

17.6 

17-6 

Day 

28 

29 

30 

31 

32 

33 

34 

35 

36 

2^day term 

21-8 

22-9 

23-9 

23.9 

24 

23.1 

22.1 

21.1 

19.2 

29-day term 

21-7 

23-7 

25-7 

26.7 

27.8 

27-7 

27.7 

26.6 

25.4 

Sum 

43.5 

46-6 

49-6 

50.6 

61.8 

60.8 

49.8 

47.7 

44.6 

Giveningraph 

26 

29 

32 

33 

34 

33 

32 

30 

27 

Diff. 

17-5 

17*6 

17-6 

17-6 

17*8 

00 

17.8 

17*7 

17-6 


The numbers in the last line are nearly constant, the 
deviations from constancy being no more than might be 
expected from the inaccuracy of the numbers taken to repre¬ 
sent the 24-day and 29-day terms; so the given observations 
may be cuxounted for as the resultant of two constituent oscilla¬ 
tions of periods 24 and 29 days respectivdy, together with a 
constant term. We therefore write 


, a 2ir< 
tt, = a + /3008-^ + 


. ^irt 2‘irt 

y Sin ^-f 8 COS-gx + e Sin 


2mt 
24 ’ 


the constant a must have the value 17, since this is the mean 
value of Uf, and the constants p, y, 8, e may be determined by 
least squares. The final result is 


-17 + 10 sin + 7 sin 


where t denotes the time in days. 


176. Bibliographical Note. —Methods for the discovery of hidden 
periodicities have been given by many writers, beginning with Lagrange ♦ 
in 1772 and 1778. Lagi‘ange’s method, which has been improved by 
Dale,*!* is quite different from the method described above. 

In the latter half of the nineteenth century attention was given 

♦ CEuweSt 6, p. 606 ; 7, p. 686. 

t JEbnthly Not, lt,A»S, 74 (1914), p. 628. Carse and Shearer, Edin. Math» 
TraOa, No. 4, p. 41. 
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chiefly to methods which depend on the principle that (in the notation of 
§ 173) the sequence U^, . . •, preserves any periodicity of period 
•p that may be present in the observations, and does not preserve other 
periodicities. Methods of this kind were originated by Buys-BaUot* * * § 
and developed by Stracheyj* and by Stewart and Dodgson.^ Stokes § 
suggested that in order to test an observed function u(a;) for a period 

u(x) cos nxdx might be calculated; 

if a true periodicity of v{x) is represented by the term c sin (u'x + a)^ 
then when n is near n' the integrals will involve terms 


the integrals sin nxdx and f 


which are of lai^e amplitude and long period, and are therefore readily 
detected. 

Schuster discussed the matter in a number of important memoirs,f| 
in which the periodogram was introduced. Let a function u(x) of the 
time X take the values Uq, Up Un_i at equidistant values of the 

time Xq + a, Xq + 2a, . . ., + (n - l)a. 

_ ^ ^TTC 

Let A « ^ Mg cos —, 

P 


n-l 

B= S Mg sin 

gs=0 


27rs 

T’ 


and let 


S = 


(A2-fBg)a 

n 


Then the value of S in the neighbourhood of a particular value pa 
was defined by Schuster to be the ordinate of the periodogram for 
that period. It was remarked by Craig H that Schuster’s formulae were 
equivalent to those arrived at in finding the correlation coefficient r 
between the sequence m^, Mp . . ., and the sequences 

2v 47r fiTT 

1, cos —, cos —, cos —, .... 

’ p p p 

, . 27r , 47r , Qw 

and 0, sin —, sin —sin y, .... 

* Les changemeiUs piriodtqties de temp&ature, Utrecht (1847), p. 34. 

t Proc. RS. 26 (1877), p. 249. 

t Proc. JLS. 29 (1879), p. 106. 

§ Proc. n.S. 29 (1879), pp. 122, 303. 

11 Terrestrial Maejnefismt 3 (1898), p. 13; Camh, Phil. Trans. 18 (1900), 
p. 107 ; Proc. R.S. *77 (1906), p. 136 ; Phil. Trans. 206 (1906), p. 69. 

IT BrU. Ass. Rep. 1912^ p. 416. 
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H. H. Turner * * * § has published tables for facilitating the computations 
of Schuster’s process. 

A , method depending on the formation of difference equations has 
been suggested by Oppenheim,t and mechanical methods have been 
described by A. £. Douglass $ and W. L. Balls.§ 

The reader who wishes to pursue further the subject of this chapter 
is recommended to consult a valuable memoir by J. Bartels, “ Random 
fluctuations, persistence, and quasi-persistence in geophysical and cosmical 
periodicities ”, Terrestrial Mag. 40 (1935), pp. 1-60, and a memoir by T. £. 
Sterne and L. Campbell, “Properties of the light curve of XX» 

Annals of harvard Coll. Ohs. 90 , No. 6 (1934). Of. also Dodd, “J’eriodogram 

analysis with the phase a chance variable ”, Ecanometrica 7 (1939), p. 57; 
G. T. Walker, “ Period-hunting in practice ”, Quart. J. B. Met. 8oc. 67 
(1941), p. 15; K. Stumpff, Qrundlagen u. Methoden d. Periodenforschung 
(Berlin, 1937)« 

* Tables for Facilitating the Use of Harmonic Analysis^ by H. H. Turner 
(Oxford University Press, 1913). 

t Wien 8iUungeber. 118 (2a) (1909), p. 823; cf. F. Hopfner, ibid. 119 (2o) 
(1910), p. 361. 

t A^ophysical J. 40 (1914), p. 326; 41 (1915), p. 173. 

§ Proc. R.8. 99 (1921), p. 283. 



CHAPTER XIV 

THB NUMERICAL SOLUTION OF DIFFERENTIAL EQUATIONS 


177. Theory of the Method. —The best method of inte¬ 
grating differential equations numerically is one devised by 
J. C. Adams;* it is applicable to equations of any order, but 
for simplicity we shall describe its application to equations of 
the first order. 

Let the differential equation be 

( 1 ) 

with the initial condition that y is to have the value when 
X has the value Xq. Let Xq, ... be a sequence of 

values of x at equal intervals w apart; we shall denote the 
corresponding values of y by y^, y^, 7 / 3 , . . . and the corre¬ 
sponding values of by q^, 2 ^ 3 , .... The differences 

(^1 “ yo)> {//2 “ denoted by Ay^, Ay^, etc., as usual. 

It will generally be found convenient to choose the interval w 
so small that differences of order above the fourth (or, better 
still, differences of order above the third) may be neglected. 
The value y^ being given, the problem is to determine 

Vv Vv ?h . 

The first four of these values are determined in the following 
way: by differentiating equation ( 1 ) we have 




( 2 ) 


* Theories of Capillary Action, by F. Bashforth and J. G. Adams: Cambridge, 
1883. The method has been since developed in Russian memoirs by A. KrilofF. 
G. Stdrmer, Comptes rendus du cong. ini., Strasbourg, 1920, develops a similar 
method for equations of the second order. For more recent developments cf. 
R. V. Mises, Zeitschr.f. angew. Maih, 10 (1930), p. 81; G. Schulz, ibid. 12 (1932), 
p. 44; D. R. Hartree, MancheOer Lit. and Phil. 80 c. JHem. 77 (1932), p. 91; 
V. M. Falkner, PhU. Mag. 21 (1936), p. 624. 
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which exprcBsee 


da? 


in terms of x and y. 


By differentiating 


(2) we can similarly obtain ^ in terms of x and y, and so on. 


In this way .all the successive derivatives of y with respect to 
X are obtained as known functions of x and y. Thus in Taylor’s 
expansion 



(x-XnY (<Py\ 

2 ! \<mJ 


+ 

0 


.. ( 3 ) 


the quantities y^, 


. are all known, and there¬ 


fore y can be found for any value of x near to x^ By sub¬ 
stituting w, 2w, 3w, 4w for (x - x^) in equation (3) we obtain 
the values of y^, y^ y^, y^ with as great accuracy as may be 
desired; or alternatively, we may compute . . . from 

the equations 



and then calculate y,, y,. y^ by building up the difference 
table of the y’s. Then from either the equation 

or the equation 



we obtain the values of y,, y,, y^. 

Now with these computed values we form a difference table 
thus; • 
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«• 

A}. 

A®g. A^q. 

AV 

% 




7i 





Aji 

A*?o 


at 


A'g-i 

A*?o 


Ajj 



a% 

A?8 



a* 




The further process 

consists in extending 

this table by 

adjoining new 

(sloping) lines. To effect this, we remark that 

to the symbolic formula 




*■ /i 

)’' = 1 + ^AE-i + '‘^j2^We-» + . . . 


there corresponds the interpolation formula (the backward form 
of the Gregory-Newton formula) 

+ rw) = 2 „ + + -^-r^A^q „.2 + -—- 'A’^qn-a 

(4, 

Now yn+i-y«=£ 

1 /Cr«+w 

= 1 / qdx 

= / q{Xn-\‘Tw)dr, 

Jo 

Substituting the value of q{oCn-\’rw) from (4) and performing 
the integrations with respect to r, we have 

1 5 3 

yn+l - yn = 2n + 2 + j 2 ^V -2 + 


251 A4 . /RN 

+ ^A«?„-4 + . •• (5) 


and in particular 


1. 6... 3., 251.4 

ys - ^4 = ?4 + 2 ^^9 12 8 ^ 720 ^ 
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Every term on the right-hand side of this equation is known, 
so by meams of it we can compute y^. The equation' 

enables us now to compute and so to adjoin a new sloping 
line to the difference table of the 

Next we put in equation (5) and use this equation to 
compute^,; then from the equation 

we compute q^ and so obtain another sloping line in the 
difference table. 

Adams’s process consists simply in the repetition of this 
operation. It may evidently be extended to any number of 
simultaneous equations each of the first order, such as the pair 

and so to any system of ordinary differential equations 

Ex,—Given the differential equation 

dy y’-x 
dx’^y^-x 

with the initial values Xq^O, y^^ly tabulate the solution from 
to as s 0*2. 

We have by successive differentiation 

yy* -y + x=z0, 

+1 = 0 , 

3^y' + y2/'" + y" + a:y'" = 0, 

3y"a + + 2y'" = 0, 

I0y”y*** + 5y + yy^ + + 3i^'^ = 0, 

lOy'"* + + 6yy + yy^^ + xy^^ + 4yv ^ q, 

whence the initial values are 

Vo =-1, %" - - 2, = 8, » eo, V = 640, = - 8840, 

and hence the Taylor series is 

, « 4 . 5 ^ 16 ^ 221 ^ 

y-l+a;*-a;*+gat»-^+—+ . . . 


with 


80 221 

y'sssl - 2a5 + 4a!^- 10a?+-;r-a!*—^ 
o o 
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Talcing it; = 0 * 02 , we have from these aeries 


1-000000, 
1/1=1-019610, 
1/2= 1-038479, 
2/3=1-056669, 
2/4=1-074*196. 


53 = 0 - 02 , 

51 = 0-019230, 

52 = 0-018516, 

53 = 0-017851, 
54=0-017228. 


The remaining values 2 / 5 , I/ 7 , . • • are computed from a difference 

fable of the q% using equation ( 6 ); the corresponding values 5 ^, 5 ^, , , , 

being given by the formula q^ = w{y„ — + a;„). We thus arrive 

at the following results: 


0^=0 

2/3=1-000000 

90 = 0-02 

A. 

A2. 

A». 

A^. 

-770 




aCj = 0-02 

2/1=1-019610 

21 = 0019230 

-714 

56 

-7 


a;2=0*04 

^2= 1*038479 

22 = 0-018516 

-665 

49 

-7 

0 

flfgs 0-06 

2/3=1-056659 

23 = 0-017851 

-623 

42 

-5 

2 

a;^=0-08 

2/4=1-074195 

24=0-017228 

-586 

37 

-3 

2 

0:3 = 0-10 

2/5=1-091126 

25=0-016642 

-552 

34 

-5 

-2 

a-3 = 0-12 

2/3=1-107490 

25=0-016090 

- 523 

29 

- 1 

4 


2/7=1-123317 

2, = 0-015567 


28 


-3 

-495 


-4 


a;g=0-16 

2/8=1-138632 

28=0-016072 

-471 

24 



0:3 = 0-18 

2/3=1-153469 

28 = 0-014601 






a;i3 = 0-20 1/10= 1-1^7842 

The above value of 1/10 is correct to the last digit, 

178. Bibliographical Note. —Of the other methods which have been 
proposed for integrating differential equations, the best known is that of 
Bunge, Math, Ann, 46 (1895), p. 167, improved and extended by Kutta, 
Zeits, /. Math, u, Phya, 46 (1^1), p. 435. See also Lindel 6 f, Acta Soc, 8c, 
Fenn, 2 (1938), No. 13. 

A method which permits the determination of an upper limit to the 
error involved has been described by Steffensen, Sdrtryck ur Skandifiaviak 
Aktuarietidskrift^ 1 ^ 22 , p. 20. 

On the numerical solution of partial differential equations, cf. Gorakh 
Prasad, PhiL Mag. 9 (1930), 1074. A valuable monograph on the 
numerical integration of differential equations, both ordinaiy and partial, 
written by A. A. Bennett, W. E. Milne and H. Bateman, was published by 
the National Research Council, Washington, D.C., in 1933. 



CHAPTER XV 


SOME FUBTHEB PROBLEMS 


In this chapter we shall give a brief treatment of various 
topics which cannot be discussed more fully here on account 
of limitations of space. 

179. The Summation of Slowly-Convergent Series.— 

Many of the commonest series of Analysis converge very 
slowly. Thus with Brouncker’s series for log, 2, 


log, 2 = 


1.2 3.4 5.6 7.8 9.10 


a hundred terms are required to give the sum accurately to 
two digits; 10,000 terms are required to give it accurately to 
four digits; and 1,000,000,000 terms are required to give it 
accurately to nine digits. 

Stirling,* in 1730, showed how a series of this kind may be . 
transformed into one which is rapidly convergent. 

His method is to expand the general term of the given 
series as a series of inverse factorials; thus the general term 

of Brouncker's series is i> f. • • • 

and this may be expanded as a series of inverse f^torials in 
the form 

1 1 1.3 

*“ 2%:(« +1) ^ 2*te(a! + l){sc + 2) ^ 2^(aj + !)(« + 2)(« + 3) 

1.3.6 

■‘■2^(« + 1)(!B + 2)(® + 3)(»+4)''’' • * 

Now form the sum + + + - • • The sums 

* MeOh. Diff. (1780), Prop. II. Ex. 5. 

^ ' 868 
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arising from the individual inverse factorials can each be 
summed by the well-known algebraical formula, and thus we 
obtain 

., _ 1 , i , i.1.3 

«a + “»+i + '*‘»+t + - • • 2'hi'^ 2*x(x+l)'^ 2*x{x+l)(a!+2) 

2'^(a;+l)(a;+2)(a: + 3) ' ‘ 

In this formula put 13^. Thus 

111 1 i ^.1.3 

27.28 29.30'^ 31.32 ~ 2M/. 2».V-V 

i.3.6 

4- _*-L 

^ 05 2 7 2 9 3 1 8 » ^ • • • 


The series on the right is rapidly convergent and yields the 
sum 0*018861219 . . .; and the sum of the first thirteen terms 


of Brouncker's series • 


to be 0*674285961 . , 
for the sum to infinity of Brouncker’s series 


. + 25 ^ found by addition 
Adding these sums, we have finally 


log^ 2 = 0*693147180. 

Stirling’s method was extended by himself {Meth, Diff, (1730), 
Prop. III.) to the case of series whose nth term involves an nth power. 
Another method was given by Kummer in Journal fiir Math, 16 (1837), 
p. 206. See further Catalan, Mem. Belg. cour. 33 (1865) ; Markoff^ 
M^m. de St-PSt. (7) 37 (1890) ; Andoyer, Btdl. de la Soc, Math, de 
France, 33 (1905), p. 36 ; and Bockwinkel, Nieuw Archief voor Wiskunde 
(2) 13 (1921), p. 383. 

180. Prony’s Method of Interpolation by Exponentials.*— 

We shall now show how a function k[x), which is specified by a 
table of numerical values, may be represented approximately by 
a sum of exponentials 

K(x)^VeP^ + Qle^^-\‘Ber^ + . . . + 

where P, Q, E, . . V, p, q,r,..,,v are constants which are 
chosen so as to give the closest possible representation of the 
given numerical values. Let the given values of k{x) be 
'fQ* '^ 2 * • • • corresponding respectively to the values 

0, w, 2w, 3w, ... of the argument x. 

* A. L. Prony, Jour, de V£e. Pol, Cah. 2, (an IV.), p. 2P. 


(d8U) 


18 
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If ic(a;) could be represented emctly as a sum of 
exponentials, saj 

Pei» + Qe4» + Ee'» + . . . + Ve«*, 

then k(x) would satisfy a linear difference equation of the Tonn 
Aic«4.,4 + B/c„+^_l + C»c,4.,,_g + . . . + M#c„ = 0, * 

where che roots of the algebraic equation 

A®** + B®'*■* + . . . + M = 0 

would be e^, «•“. Prony's method, which is based on 

this fact, is to write down a set of linear equations 

Ak^ + BK|^_j + C(C|,_ J.+. . . + Mk0 = O, 

Ak^+i + Bk,, +C»c,,_i + . . . + Micj = 0, 

A*^+j + Bk|„^i + Cic^ +. . . + Sdxg=0, 


(where the quantities kq, k^, k,, Kg, . . . are known, since k(®) is 
a known tabulated function), and by the ordinary method of 
Least Squares to find the values of A, B, C, . . ., M which 
best satisfy these equations; then with these values of 
A, B, C, . . ., M to form the algebraic equation 

A®'‘ + B®'‘'* + . . . + M = 0 

and find its roots; these roots will be «*", . . ., e^, and 
thus p, q, . . V are determined. Knowing p, q, . . ., v, we 
have a set of linear equations to determine the coefficients 
P, Q, . . ., V, and these also, are to be solved by the method 
of Least Squares. 


Ex. 1,—If from (he data 


X 

10 

20 

30 

40 

50 

60 

7(K 

80 

90 

k(x) 

6460 

6090 

5642 1 

5049 

4417 

3623 

2401 

983 

142 


we repreeent «(«) tn the form 

Po» + Q)8» + Ry», 

prove that the beet vahiee o/ a, j8, y are the root* of the eubie 

sf - 3-02923i«* + 3-78779* - 1-68664 = 0. 

(W. S. B. Woolhonse.) 
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Ex, S.—From the data 


X 

0 

8 

1 « 

24 

32 40 

k{x) 

248 

345 

421 

481 , 

1 5291569 



x(ii!) = 629-3763 X 1-0014438» 

- 381-3767 X 0-9670412® 

+ 0-00046064 X 1-2361997* 


(F. Selling.) 


181. Interpolation Formulae for Functions of Two Argu¬ 
ments.* —The formulae of interpolation for functions of a single 
argument, which have been established in Chapters I.-III., may 
be extended to functions of two or more independent argu¬ 
ments. Thus we can introduce divided differences (cf. § 11) for 
a function f{x, y) of two arguments by the definition 


/«.M >y) 

and obtain a formula analogous to Newton’s formula for unequal 
intervals (§ 13), namely, 

My)=/oo(®.y) 

=/oo(®i. ^i) + (® - y) + (y - yiVii(®i. y) 

=/to(®i. yi) + (® - ®i)/io(®. yi) + (y - yi)/oi(®. y) 

=/oo(®i. yi) + (® - ®i14o{® 2. yi) + (® - ®i)(® - ®2.'14(®. yi) 

+ (y - yil/®i(®i. y.) + (® - ®i)(y - yi)Ai(®. y) 

+ (y-yi)(y-y»]l/i«(-®i.y). 

etc. 

From this all the interpolation formulae for a function of two 
arguments may he derived. Thus the polynomial of the second 
degree, which at the places 

(®i.yi) (®i.y.) (®i.y8) 

(®i.yi) (®*.y 2 ) 

(®8.yi) 

♦ Cf. K. Pearson, Tracts for Computers, No. III. (Cambridge, 1920) *, 
completed (by the discussion of some omitted oases) on pp. x et seq, of the 
Introduction to Tables of the I'ncomplete Gamma Function \ S. Narumi, T^ohu 
Math^Joum, 18 (1920), p. 309. ^ 
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takes the values 


respectively, is 


Vn Va Va 
Va Vn 


. ( 111 + ^21 +_ VZ1_ _\ 

\(ai. — x^) (x. ~^xA i.T^ — Waj^ - x^ ix^ — aJi) (a/3 -■ a?2 )' 


. (» - a:j)(aj - a!j) 

^ ( _?iii_l._ ’?M . ^ Va _ 

M»’i-®3)(yi-ya) (»i-*8)(.ya-yi) (®a-*i^(yi-ya) 


( . ^11 


^;i2 


%5. 


, - V.i) 


(y»-yi)Lvt-y^ b/a-?h)iy9-y^^ 

• {y-yi){y-ys)- 


In particular, taking =>= 0, x^ = y^- - 1, we 

have the formula 

/(®j y) =A 0 + i®(/i, 0 ~/-i, o) + iy(/o, 1 -/o, -i) 

+ i®*(/i, 0 - 2 /o, 0 +/-1. o) + ®y(/o. 0 -/o, 1 -/i, 0 +/i, i) 
+ iy^i/o, 1 “ 2/o,0 +/o, -i). 

which is the best for general use when x and y are positive and 
less than 

Simikurly we may determine * the polynomial of degree m 
which at the \{in + l)(m + 2) places 

K. yi) (a^ y^ • • ■ {«>m> yi) («m+i. ^i) 

yJ (aw ya) • • • (»«.» y») 

• •• • 

(ai. ym+l) 


takes the values 

^11 VtL • • • Vml Vm+l»l 

Vl2 ^22 • * • ’/toS 

• • • • • 

Vh m+V 

* pf. 0. Biermann, JfonaMtfle/Ur Math. 14 (1903), p. 211. 
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Considering now the case when the entries are given at 
equal intervals ^ of x and equal intervals r/ pf y, and introducing 
the notation 


Kf=A‘^>y+v) -A^>y)> 

^y+ 1 /) -/(». y+n) -A^+1 y) +/(*> y). 

etc., 

the above generalisation of Newton’s formula for unequal 
intervals becomes 

'=/(®. y) + «A,/+ mnA*f 

mn{n-l) w(n-l)( w-2) 

^ 2 ^ ^ 2 3 ^ • 

This, which may be regarded as the extension of the Gregory- 
Newton formula to functions of two arguments, is due to 
Lambert.* Symbolically it may be written 

/(* + y + wj,) = (1 + Af)’"(l + A,)'yi[a:, y). 

It is sometimes practicable and advantageous to reduce interpolation 
of functions of two arguments to linear intei'polation. j* For example, 
if y) is tabulated for quinary values of x and y, the u corresponding 
to any integral values of x and y may, by a proper choice of origin, be 
reduced to the form u{±t, ± a), where t and 8 take one of the values 
1 and 2. The value, of u can be found, by interpolation along the line 
of values of the form w( + 5i, + 5«): for example, u(—1, 2) may be 
found by linear interpolation along the line of values . . . w(-6, 10), 
w{0, 0), w(5, - 10) . . . 

Ex. 1. —Complete the accom.panying table on the assumption that third- 
order differences are everywhere zerOy and express the tabulated function as a 
polynomial in two variables. 

* Beytrdge^ Part III. Cf. Lagrange, Nouv, Mim. de Berlin (1772), reprinted 
(Euvres^ 8, p. 441. 

t Elderton, Biometrika, 6 (1908), p. 94; Spencer, J.LA. 40, p. 299; Burn 
and Brown, Elements of Finite Differences^ §§ 148-164. ^ 
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X 

= 0 

1 

2 

3 

0 

3 

5 

9 


1 

2 

. i 

3 



2 

3 




3 






[Answer^ M = 3 + a;-2y + Jc^“a!i/ + y®. ] 


Ex, 2 .—The following table gives the time (in hours, minutes, and secondi^ 
corresponding to certain altitudes (a) of the sun in various declinations 
CLt a place in a certain latitude. 




a = 10* 

1 


14* 



18* 

] 


22* 


8 = 20* 

6^ 

lim 

26* 

6* 

60’" 

17* 

5^ 

29’" 

27* 

6^ 

8’" 

48* 

16* 

5 

66 

41 

6 

35 

5 

6'' 

14 

39 

4 

64 

17 

10- 

5 

40 

16 

5 

19 

66 

4 

69 

37 

4 

39 

17 

5* 

1 6 

24 

60 

6 

4 

30 

4 

44 

4 

4 

23 

29 

0* 


9 

6 

4 

48 

29 

4 

27 

39 

4 

6 

28 


. Find the time corresponding fo a= 16*, 8= 12*. 

[Answer, 5^ 15»» 50*.] 

Find also the time corresponding to a=5 20*, 8= 14*. 

[Answer, 6* 1»« 30»] 

Ex. 3.—The construction of isobars on meteorological charts essentially 
involves inverse interpolation with two arguments. The ordinary con¬ 
struction assumes tliat, within small intervals, the barometric pressure is 
a linear function of the co-ordinates of the place. A more accurate con¬ 
struction due to Thiele [Tidsshr.^^^ 4 (18V3), p. 87], which is specially 
useful near maxima and minima, is thia Three neighbouring places of 
observation are connected by lines, and on these the points arc found 
wh^re, according to the ordinary method, there would be a certain baro¬ 
metric pressure u. The line joining any two of these points cuts the 
circumscribing circle of the triangle in points which lie on the isobar 
belon^ng to the pressure u. 

Sh^ that ITUelds construction may be derived from the assumption that 
the barometric pressure may be expressed by a function of the form 

u=sa + bx + cy + d(x^ + y^. 

182. The Numerical Computation of Double Integrals.— 
It is easj to construct formulae for the evaluation of double 
integrals on the same principle as the Newton>CoteB and Gauss 
formulae of single integration (§§ 76,80). Thus when differences 
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of the fourth order (in the two variables combined) are neglected, 
we have (by a double application of Simpson’s formula) 

y)dx =» ^(6 - a)(d - c) I /[a, c) +/(<?, d) +yi(6, c) 


This formula indeed is true even when some of the differences 
of orders 4 to 9 are not negligible, e.g. A^*A * A/A^, 

A «A >. A »A *, A *A ». 

When differences of the sixth order are n fleeted, we have 
Burnside’s formula,* 

/'. £ /<*■ - s{/(n/b’ “) V b) 

-s/^o) ^/(o- -Jm)]* i§{/(Vs’ 


This is exact so loDg as /{oc, y) is a polynomial of degree not 
exceeding 5. 

A formula similar to Burnside’s may be obtained by a double applica¬ 
tion of Gauss’s three-term (fifthMiifterenc^ formula. This also is valid 
even when many of the higher differences are not negligible. 

It is often advantageous to break up the field of the double integration 
into sections, and apply a formula to each section separately. 

For other formulae cf. W. F. Sheppard, Proc, Lond, Math Soc, 31 
(1899), p. 486 ; 32 (1900), p. 272; A. C. Aitken and G. L. Frewin, Proc. 
Edin, Math Soc. 42 (1923-4); M.S^owski,Amer.ifa/A.J/on/^^ 47 (1940). 


Ex. 1. —Show that the value of 


ff 


dxdy 




computed by 


Bumndds formula^ 0 6641. 

[The true value is 0*6638 to 4 digite."] 


Ex, 2 .—Show that the value 



computed hy 


Bumeidds formula, is 0*9262. 

[The true value is 0-9202 to 4 digits.] 

♦ Mess, of Maths. (2) 87 (1908), p. 166. 
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183. The Numerical Solution of Integral Equations— 
In recent years Int^ral Equations have proved to be of.great 
importance in Applied Mathematics; thus it was by the 
numerical solution of an integral equation 'that Knott was 
enabled in 1919 * to deduce the forms of the seismic rays in 
the earth’s interior by a rigorous mathematical method from 
the observational data of earthquakes. 

Some types of integral equation which occur in Applied 
Mathematics are as follows: 

(L) Aht^s Original Eqwtwn .—This is 

(0<p<i.A0)‘=0) ( 1 ) 


where €f,(x) is the unknown function which is to be determined, 
and /(x) is a given function. The solution of this equation, which 
was given by Abel himself,f is 


sin^jr 

XT 




(x-s)^-P' 


( 2 ) 


When /(») is given, the values of/'(«) and of this integral may 
be obtained without difficulty by the ordinary processes of 
interpolation and numerical integration. 

(ii.) Integral Equations of Aid’s Type. —These, which may 
be regarded as a generalisation of ( 1 ), are of the form 

jf <#.(«)«(»-s)ds=./(a!), (3) 


where k(x) is a given function called the nucleus, f[x) is also a 
given function, and ft>(x) is the unknown function which is to be 
determined. We need only consider the case when the nucleus 

* Pfoc, Jt.S.E, 89 (1919), p. 157. It was H. Bateman, Phil. Mag. (6), 19 
(1910), p. 576, who discovered the integral equation. 

t CBuvres (ed. 1881), p. 11 (1823) and p. 97 (1826). The fundamental 
meaning of AbeTs result is most clearly seen if the integrals which occur in it 
are interpreted as in the theory of generalised differentiation; if ^(a;) is written 
for r(l ~j9)^(a;), AbeTs formula reduces to the simple statement that if 


then 
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k{x) becomes infinite at x = 0, for in the simpler case, when 
the nucleus is finite at = the equation can be reduced 
immediately (by differentiating it) ta Poisson’s type (iii) and 
dealt with by the methods appropriate to that type. We may 
then suppose k{x) to be such that xPk{x) is finite and not zero 
at = 0, where p lies between 0 and 1; and for the purposes of 
numerical integration we can represent it, by the methods 
of Chapters J.-III., with as great a degree of accuracy as may 
be desired, by an analytical expression of the form 

K{x)=x-‘^{aQ + a^x + a^^ + . . . + a^x^). 


Then * the solution of the integral equation (3) is 

^(x) = f f(s)L(x - s)ds, 

^ Jo 


where 




F(/8) 




and where o, j8, . . .,v are the roots of the algebraic equation 
F(a;) = (i^'* +(I - p)aja;"-^+ (1 {2-p)aja:*-® + . . . 

+ (l-i?)(2-^7) . . . 


and where y^ix) denotes the Incomplete Gamma Function 
7^(a;) = e^j^ sP'h'^^ds, 


This may be regarded as a direct extension of Abel’s original 
formula (2), which may be derived from it by taking n = 0. It 
expresses the solution of the integral equation in a finite form 
in terms of the Incomplete Gamma Function, of which tables 
have been published. | 

(iii.) Integral Equations of Poisson's Type, —These are of the 
form 

<i>{x) + \ <I>{s)k{x - s)ds =/(i»), (4) 

Jo 


where k{x) and/('»■) are given functions, and <l>{x) is the unknown 
function which is to be determined. Several different methods 
for the numerical integration of this equation are given in 

♦ Whittaker, Proc, Ji,S, 94 (1918), p. 367. 
t K. Pearson, Tables of the Jneowplete Oamma Fundion (1^2). 

(dSU) * 
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.Whittaker^s memoir of 1918,* and hare been applied to bhcf 
solution of certain problems In viscous fluid motion by Havelock.f 
We shall here indicate the most useful of them. 

Since the nucleus k{x) is supposed to be specified by a table 
of numerical values over the range of values of x considered, 
we may apply Prony’s method of interpolation by exponentials 
in order to represent it analytically in the form of a sum of 
[t exponentials 

ic(aj)-Pe»“ + Qc9* + . . . + Ve«», (5) 

where (P, Q, . . ., V, p, q, . . ., v) are constants which are 
chosen so as to give the closest possible representation of the 
given numerical values. Taking then this form (5) for the 
nucleus k{x), we shall show that the integral equation (4) may 
be satisfied by a solution of the form 

^(®) =/(®) - j^K(a: - (6) 

where the solving function K(;r) is also a sum of p exponentials, 
say 

K(a:) = A«" + B«^ + Cei* + . . . + Nc’*. (7) 

To prove this we remark first that certain existence-theorems 
established by Volterra justify us in assuming for the solution 
the form (6), where K(a!) is now the function to be determined. 
In (6) put k(x) foT/(x): thus 

^(x) - k(x) - j^K(x - s)K{8)d8, 

which gives the value of <l>{x) corresponding to this value 

of/(a;). 

Putting (® - s) for s in the integral, we have 
i>{x) = k(x) — y*K(s)it(® — 8)d8. 

Comparing this with the integral equation (4), after replacing 
/((b) by k{x) in the latter, we have 

<f>{x)’MK{x), 

* Loe. eit. 

t Mag. «S (1921), pp. 620, 628. 



SOME FUBTHEE PEOBLEMS 379 

and therefore the pair of functions 

</>(a:) = K(a!), f{x)=‘K{x) 
satisfy the integral equation; that is to say, 

K(a!) + J^K{8)k{x - s)ds = k{x). (8) 

In this equation substitute the value (5) for k(x) and the 
value (7) for K(»). Thus we have 

Ae*' + B6^ + CeT* + . . . + N«’« 

+1 {Ae" + Be^*+Ccv»+. , . + 

{Pcp*-p« + Q64*-«» + . . . + Ve**-"}(fs 
= PeP* + Q««» + . . . + Ve®*. 

Equating coefficients of e“* on the two sides of the equation, 
we have 

J-+- 9 _+...+JL + i.o 

a—jPa-f/ a — V * 

and similarly equating coefficients of 

■^—+7^ + ' • *+ 0 -— + 1 = 0 , 

fi-p fi-q p-v 

and so on; and therefore o, p,y,. . v are the roots of the 
algebraic equation in x, 

^X^l.O. (9, 

x-p x-q x-v ' ' 

This enables us to determine a, y, . . . . 

Next, equating coefficients of on the two sides of the 
equation, we have 


Similarly 


A B N , .I 

a-p p-p v-p I 


A B N , „ 

a-g P-q v-q f 


A B N , ,, 

-T o-H . . . H-h 1 *= 0 

a — p — V v-V 


( 10 ) 
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Since (a, p, y, . . v) and {p, q,r,.. v) are known, these 
equations (10) enable us to determine A, B, . . N; and we 
see that if the constants (a, p,y,». v) and (A, B, . . N) 
are determined by equations (9) and (10), the equation (8) is 
satisfied by the value (7) of K(a?). 

The value of K(x) may be obtained in a more explicit form 
in the following manner. If we eliminate A, B, . . N 
determinantally from the equations (7) and (10), we have 


K(*) 




1 

1 

1 

= - 


1 

1 


p-p 


1 

p-p ■ 


a p 

’ ■ V — p 



v-p 

1 

1 

1 


1 

1- 

1 

a-2 

P-q 

v-g 


P-q ■ 

v-q 

1 

1 

1 


1 

1 

1 

a-^V 

P-v 

* v — V 


X 

)8-® 

'' V -V 



1 . 

1 

a~p 

v-p 

1 

1 

1 

a-q 

X 

• • v-q 

1 

• 

1 . 

1 

a — 

* * V — V 


The determinants which occur in this equation are of the 
kind known as alternants, and may be factorised by known 
methods.* Performing the factorisation, we have 


K(») 


= _ («-y)(«»-g)(a-^‘) • • • 


(«-/J)(a-y) 

{P-p)W-q)... (P-v) 




• (“-•') 

(v-p){v-q){v-r). . .(vP) 

{v-a){v-P) . . . iy-n) 


Combining our results, we have the following theorem: 
jTAa solution of the integral equation 


<f>(l>l) + ^<I>{8}k(x - 8)ds -/(»), 


<where the nucleus k{x) is supposed to be given numerically and 

* The eyaluatioD of alternants of this type is due to Cauchy, Exercices 
d^analyse, 2 (1841), p. 151. * 

t 
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to hjive teen expressed approximaidy ty Prony's method in the 
form 

»c(a!) = Pei« + Qe«* + . , . + Ve«*, 

^ '^(®) “>(») “ 
where 


Kfa?)- -_ (^-p)(^-g)...(^-«)^ _ 

{a-P){a-y)...\a-v) (fi-a){$-y) .. .(fi - v) 

{v-p){v-q) . • . (•'-V), 


and where a, y, . 
in x, 

r 


{v-a){v~l3) . . . {v-p) ’ 

V are the roots of the algebraic equation 


Q 


X -p X - q 


* A 

x-r 


V , 

.•f-+1 = 


= 0 . 


The solution of the integral equation is thus obtained in a 
finite form which admits of computation. 

See also Prasad, Proc. Edin, Math, Soc. 42 (1924) 46. 


(iv.) Integral Equations of FredholrrCs arid Hilbert's Type ,—^The 
numerical solution of integral equations of the type 

— X /* K{xy s)<l>(s)ds = f(x). 


where K{Xf s) and f{x) are given functions, X is a constant, and ^(a;) is the 
unknown function, has been discussed by Bateman, Proc. R.8. 100 
(1921), p. 441, who gives references to the literature of the question. 
See also F. Tricomi, Lincei Rend. 83i (1924), p. 483, 882 (1924), p. 26, and 
Nystrom, Acta. Math. 84 (1930) 185. 


184. The Bayleigh-Bitz Method for Minimum Problems.— 

A great many problems in mathematical physics involve the 
determination of an unknown function to satisfy the condition 
that a given integral has a minimum value. Suppose for simplicity 
that there is only one independent variable and let the unknown 
function y{x) be required to be such as to make 



dy dN 

fT. 4 / ^ _ 2 . 

’ dx'da? 



( 1 ) 


a TYiiniTTrmiTi j and also possibly satisfy certain further conditions. 
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To solve this problem, Rayleigh in 1870* and subsequent 
years f devised ibe foUovnng method, which, was afterwards 
elaborated in a celebrated memoir by W. Ritz.j; 

Let the unknown function be expanded as an infinite series of 
known functions with arbitrary coefficients (such as a Fourier 
series), say 

y(x) = a^^x) + + Og^g(®) + ... (2) 


where ^Jix), ^^{x), ^^(x),... are functions each of which satisfies 
all the further conditions (e.g. boundary-conditions) which are 
imposed on the function ^(x). Then evidently a theoretical pro¬ 
cess might be imagined in which the function ^x) and its deriva- 
itives are replaced in (1) by their values as furnished by (2), so 
that the integral J becomes a known function of the infinite set 
of coefficients Og, Oi, Og,..and the problem would then become, 
to determine Og, Oi, Og,... so as to make J a minimum. The 
validity of such a process would of course need a careful exam¬ 
ination; but Rayleigh’s proposal was to obtain an approximate 
solution by truncating the series (2) so as to retain only a finite 
number of coefficients Og, o^,..., a„; then substituting the trun¬ 
cated series in (1), J is obtained as a function of Og, 01 , 02 ,..., On, 
and the values of Og, Oi,..., o„ can be determined from the con¬ 
ditions that J should be a minimum, namely 


9«o 



• • •> 


00 * 


= 0 . 


Thus the approximate value of ^(x) is determined. 

The same principle can be applied when J is' a multiple in¬ 
tegral taken over a domain of any number of independent vari¬ 
ables.! 


* In finding the ooireotion for the open end of an organ-pipe, Phil, Trana, 
161 (1870), p. 77. 

t Many examples are to be found in Rayleigh’s Theory of Sound, e.g. §§ 88, 
89, 90, 91, 182, 209, 210, 266; also PhU. Mag. 47 (1899), p. 656. 
t J.fUr Maih. 135 (1908), p. 1. 

§ Cf. N. Kryloff, Lea mModea de aoliUion approchde dea probUmea de la phyaique 
fnaJlhdfiuUiqm (Paris, Ganthier-Villars, 1931). G. Temple and W. G. Bickley, 
Sayleigh'a Priwiple and ita Applicaiiona to Engineering (l^ndon, Oxford U. 
Pr^ 1933). H. M. James, ** ^me applications of the Rayleigh-Ritz method 
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185. Application to the Determination of Eigenvalues.— As 
an illustration of the Rayleigh-Ritz method, we shall solve the 
following problem. 

The difierential eq^uation 

S+)w=o ' a) 

(where A is independent of x) admits a solution which vanishes 
both when a = 1 and when x = — 1 , only if A has one of a cer¬ 
tain set of special values.* These values of A are called proper 
values^ or characteristic mhies, or aiUomlveSy or eigenvalues. We 
shall now show how they may be calculated numerically. 

The differential equation ( 1 ) arises in the Calculus of Varia¬ 
tions as the condition that the Integral 

should have a stationary value. 

Let us take, as an approximation to the polynomial 

y = (1 — a;2)(ao + OjO?), (3) 

which satisfies the condition of vanishing when a; = 1 and when 
X = — 1 , and which contains two undetermined coefficients 
and cq. Substituting from (3) in ( 2 ), we have 

J = [{2(«i — «o)® ~ — A{1 — »®)*(Oo + Oia!®)®]da;. 

A 

Performing the integration, this gives 

^ 8ao® , 16a„e»i , 88ai* ^/16oo* i 820901 , 16o,*^ 

“ T' “15' 105 V 15 106 316 / 

to the theory of the structure of matter ”, Bull. Amer. Math. 8oc. 47 (1941), p. 
869. Many other references are given by H. Bateman on pp. 91-94 of Bulletin No. 
92 of the National Research Council (Washington, 1933). 

* Actually the values of A in question are ...»the corresponding 

solutions y(x) being cos sin ir*. cos ; but we do not assume this 

2 c 

knowledge in the test. 
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The conditions that J should have a stationary value, namely 
s— = 0 and s— = 0, are 

OCIq 

«.(•-¥)+ 1 ( 1 -¥)=»- 

Eliminating the ratio Og : o^, we have A® — 28A + 63 = 0, so 

A = 14 ± Vm 
= 2-467,437 or 25-632,663. 

These are the approximations to the eigenvalues furnished by 
the Bayleigh-Bitz method when the approxinmtion is made by a 
polynomial of the form -(3). The correct values of the first and 
third * eigenvalues of the difierential equation (1) are actually 

A = 2-467,401 and A = 22-207. 

Thus the Bayleigh-Bitz method, with the choice (3) for the 
unknown function, gives an excellent approximation to the 
lowest eigenvalue (correct to 5 significant figures) and a very 
rough approximation to the third eigenvalue. If we had chosen 
instead of (3) a polynomial of higher degree, we could, of course, 
have obtained better approximations to the lowest eigenvalues 
and also rough approximations to higher eigenvalues. 

* The approximation (3), sinoe it contains only even powers of g, yields only 
eigenvalues of odd order. 

[Copies of (he OomptUalion Sheets facing pages iS70 and ^8 may he 
obtained direct from the Pvblishera in quantities of not less (han one dozen^ 
at (he price of 2s. 6d. per dozen sets. Special terms wiU he quoted for orders 
of one (housand and upwards.^ 
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Chaptbh I 


Page. 

16 


Example. 


a , 3ato — 213 ^ aw^ — 2Bw + 2y _ — 3yw + 68 

_a4_j- Cl - 1^2 j. • --A- X 

Aw 6w Aw 6m* 

+ const 


4. 


(«) +1) f. . (» +p) 

/ . / «W jJJr\ 

(6) cos na; = ( 2 sm — 1 cos (jw + ^ 'iy 


6 . 3 »(a:-l)(x- 2 )+ 10 ii(a:—l) + 6 a:+ 1 . 

6 . 6 . 

7 . -a:»-3x*-6x+l. 

8 . 0-776737,4947. 

11, 0-781072,8886. 

^12. 9-414213,562374. 

7 13. 875-311046,687. 

14. /(l-25) = 0-208459, /(l-75) = 0-227977. 

15, (a) 0-433178,483028,782. 

(5) 0-433265,874297,833. 

16. 9-64861 lj469336,24. 

17, 0-861232,11. 


Chapteb II 

33 1 . /(a, 5 ) = - : f[n, h c) = (ab + he + ca)laVA ^; 

fifl, b, e,d)= - (abc + abd + acd 6 c(i)/a» 6 *c*<i*. 

3. /(3)=.18,/!:i4> = 2548. 

4. (a)!e* + !t. (bjar. 

5. / 681472. 

6 . J 130326. 

7. • /(9) = 810 : /(6-6)= 316-876. 

8 . (6)!K*-f6. (7)®*-fx*. 

886 > 
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Page. Baampla.' 

Chapter III 

51 1. 0-433702,769624,48. . 

2. 7-671672,8900. 

3. 0-433265,874297,832. 

4. 3-528273,8. 

Chapter IV 

68 1. 2-602060,0. 2-607455,1. 

2-603144,4. 2-608626,1. 

2-604226,1. 2-609594,4. 

2-606306,1. 2-610660,2. 

2-606381,4. 2-611723,3. 

a 8387. 

4. (a) 88294. (6) 84640. 

6. fc=886, i) = 24-51. 

7. j>= 14-23, d=100*-56. 

8. 1-213411,6. 
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(0) 
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(10) 

(11) 
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Ex.1. 

(a) 

62-02 
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-10-46 

1-54 

-0-17 

-1-06 

2-21 

2-66 
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-0-80 

1-08 



17-88 

8-80 

2-47 

-4-08 

-0-82 

6-88 


2-06 

1-30 

0-04 



Ex. 2. 













(aj 

111-88 

-60-41 

-86-20 

-12-18 

-6-06 

EtSiVl 

-0-08 

-8-86 

-1-63 

EHj 

0-38 

-0-28 

0-26 

(6) 


0-28 

6-61 

-7-51 

-8-01 

-1-10 


-4-58 

8-08 

0-83 

1-60 

0-90^ 


Ez.8. 


-68-16 

18-85 

0-48 

1 

-15-581 

-2-16 


1-46 

-1-17 

4-08 


-0-64 

0-71 

U 


12-26 

-8-67 

-2-18 

-4-01 I 

-9-05 

4-26 

-6-18 

BliJ 


0-82 

0-78 


Ex. 4. 
(«) 

118-26 

25-04 

22-77 

-1-87 

-5-88 

-0-70 

-1-88 

6-58 

-0-63 


8-67 

-2-26 

1-25 

Q>) 


68-83 

-82-70 

7-16 

1-08 

-6-40 

-0-60 

-1-82 

-7-68 

2-50 

-0-47 

8-58 



Chaptkr XII 


Page. Sxam^. 

333 2. r«>0-25. 

3. r=0-76. 
















INDEX OF NAMES 


(Tht numhera refer to tht ^ages) 


Abel, N. H., 376. 

Ackland, T. G., 286. 

Adams, J. 0., 363. 

Aitken, A. 0., 300, 308, 375. 
Ampere, A. M., 20. 

Andoyer, H., 369. 

Baer, W. S., 228. 

Baillaud, B., 283. 

Balls, W. L.. 362. 

Barlow, P., 94. 

Bartels, J., 362. 

Bashforth, F., 363. 

Bateman, H., 367, 376, 381, 382. 
Bayes, T., 219. 

Bennett, A. A., 367. 

Berger, A., 163. 

Bernoulli, D., 98, 112, 124. 
Bernoulli, James, 138, 141. 
Bernstein, F., 228. 

Bertrand, J., 224, 321, 327. 
Bessel, F. W., 39, 206, 224, 264. 
Bickley, W. G., 164, 382. 
Biermann, 0., 372. 

Birge, R. T., 291. 

Bockwinkel, H. B. A., 369. 

Borel, E., 119. 

Bravais, A., 323. 

Briggs, H., 2, 63. 

Brodetsky, S., 106, 128. 

Broggi, U., 216. 

Bronwin, B., 169. 

Brouncker, W., 368. 

Brown, £. H., 373. 

Bruns, H., 172. 

Burgess, J., 181. 

Bum, J., 373. 

Burnside, W., 376. 

Buys-Ballot, 0. H. D., 361. 

Camp, K., 63. 

Campbell, L., 362. 

Cardan, G., 124. 

. Carse, G. A., 360. 

Cassina, U., 163. 


Catalan, E. C., 369. 

Cauchy, A. L., 26, 86, 169, 282, 380. 
Cavalieri, B., 166. 

Charlier, C. V. L., 172. 

Chebyshef, P. L., 168. 

Chi6, F.. 71. 

Chrystal, G., 9. 

Clairaut, A. C., 263. 

Clasen, B., 236. 

Collatz, L., 131. 

Collins, J., 12, 79, 144. 

Comrie, L. J., 41, 63, 60. 

Condon, E., 291. 

Cotes, R., 164. 

Craig, J. I., 361. 

Dale, J. B., 360. 

Dandelin, G. P., 94, .106, 

Daniell, P. J., 163. 

Danielson, G. C., 284. 

Darbishire, A. D., 330. 

Dary, M., 79. 

Davis, W, B., 61. 

De Moivre, A., 176. 

De Morgan, A., 18, 20, 86, 106, 120^ 
130, 149. 

Descartes, R., 123. 

Dodd, E. L., 218, 362. 

Dodgson, W., 361. 

Douglass, A. E., 362. 

Edgeworth, F. Y., 172, 269. 

Elderton, W. P., 339, 373. 

Emde, F., 63. 

Encke, J. F., 107, 234. 

Erlang, A. K., 208. 

Euler, L., 98, 112, 136. 

Everett, J. D., 40, 46. 

Faber, G., 86. 

Falkner, V. M., 363. 

Fechner, G. T., 217. 

Fermat, P., 317. 

Fisher, R. A., 199. 

Forsyth, C. H., 69. 



390 THE CALCULUS OF OBSERVATIONS 


Fourier, J. B., 123, 169, 260, 263. 
Fnser, D. 0., 12, 46,49, 50. 

Frewin, O. L., 375. 

Oalton, F., 167, 184. 

Gauss, K. F., 37, 169, 160, 176, 183, 
185, 199, 201, 203, 215, 224, 227, 
228, 234, 241, 242, 243, 247, 255, 
257, 283, 300, 374. 

, Gerling, C. L., 257. 

GUisher, J. W. L., 144, 262. 
Goedseels, P. J. £., 259. 

GraefTe, G. H., 106. 

Gram,J.P., 172. 

Gregory, J., 2,12, 79,144,154,156. 
GruW, O., 167. 

Hardy, G. F., 151,193,286,288. 
Hartree, D. R., 363. 

Hausdorff, F., 177, 204. 

Havelocli, T. H., 378. 

Hermite, 0., 174^ 283. ‘ 

Higham, J. A., 289. 

Hopfner, F., 362. 

Homer, W. G., 100. 

Hulme, H. B., 208. 

Hutton, 0., 100. 

Ingram, A., 100. 

J{^kson, D„ 197. 

Jacobi, C, G. J., 140, 251, 267. 

James, H. M., 382. 

^ Joffe, S. A., 36. 

Kapteyn, J. G., 329. 

King, A. £., 151. 

King, G., 57, 59, 303. 

Knott, G. G., 376. 

Kiiloff, A., 363. 

Kryloff, N., 382. 

Kummer, E. £., 369. 

Kutta, M. W., 367. 

Lagrange, J. L., 29, 53, 98, 263, 2S2, 
360, 373. 

Lambert, J. H., 373. 

JjanczoB, G., 284, 316. 

Laplaoe, P. S., 40, 167, 168, 172, 184, 
224, 227, 259, 300. 

Legendre, A. M., 210, 259. 

Leibnitz, G. W. von, 154. 

Lidstone, G. J., 41, 49, 59,193, 288. 
Lin, S. N., 131. 

LindelOf, E., 367. 

Lippmann, G., 179. 

Lobachevi&y,'N. I., 106. 

Lubbock, J. W., 149. 


Maclaurin, G., 135, 155. 

Markoff, A. A., 369. 

Martin, Q., 280. 

Mayer, T., 259. ' 

MeideU, B., 218. 

Milne, W. E., 367. 

Mises, R. V., 363. 

Mouton, G., 68. 

Narumi, S., 371. 

Newton, 1., 12, 15, 20, 25, 37, 39, 
79, 85, 124, 164. 

Nicoletti, 0., 81. 

Nielsen, H. P., 144. 

Nystrdm, £. J., 381. 

d’Ocagne, M., 128, 248. 

Oppei^eim, S., 362. 

Oppermann, L. H. F., 20. 

Ostrowski, A., 131. 

Peano, G., 32, 156. 

Pearl, R., 328. 

Pearson, K., 316, 328, 336, 338, 342 
371, 377. 

Peters, G. A. F., 206. 

Pizzetti, P., 215. 

Poincar6, J. H., 179, 218. 

Poisson, 8. D., 168. 

Pdlya, G., 119. 

Prasad, Gorakh, 367, 381. 

Prony, A. L., 369, 378. 

Raabe, J. L., 141. 

Radau, B., 161,162. 

Raphson, J., 85. 

Rayleigh, J. W. Strutt, Lord, 208,381. 
Rhodes, E. G., 316. 

Rietz, H. L., 334. 

Rigaud, S. P., 2, 12, 79,144. 

Ritz, W., 381. 

Ross, R., 81. 

RufGini, P., 100. 

Runge, G., 271, 367, 

Sadowski, M., 375. 

Schiaparelli, G., 216. 

Schimmack, R., 216. 

Schott, G. A., 267. 

Schulz, G., 363. 

Schumacher, H. G., 228. 

Schuster, A., 346, 361. 

Seeliger. H. H., 217. 

Seidel, P. L., 218, 266. 

Selling, F., 371. 

Sharp, H. S., 131. 

Shea, J. D., 291. 

Shearer, G., 360. 



INDEX OF NABIES 




Sheppard, W. F., 36, 49, 161, 181, 
104, 196, 291, 300, 376. 

Sherriff, C. W. M., 291, 297. 
Shovelton, S. T., 161,162. 

Simpson, T., 166, 166, 376. 

Smeal, G., 106. 

Smith, J., 12. 

Spencer, J., 290, 373. 

Sprague, T. B., 149, 286, 303. 
Steffensen, J. F., 34, 156, 367. 

Sterne, T. E., 362. 

StewaH, B., 361. 

Stirling, J., 39, 61, 138, 140, 368, 
369. 

Stokes, G. G., 361. 

StOrmer, C., 363. 

Strachey, R., 361. 

Stnmpfif, R., 362. 

Sturm, J. K. F., 123. 

Temple, G., 382. 

Thiele, T. N., 171, 172, 374. 
Thompson, S. P., 271. 

Todhunter, R., 41. 

Tricomi, F., 381. 


Turner, H. H., 362. 

Uven, M. J. van, 321. 

Vall4e-Pous8in, 0. J. de la, 259. 

Vieta, F., 79. 

Volterra, V., 378. 

Walker, G. T., 362. 

Wallis, J., 85, 86, 130. 

Waring, E., 29,112. 

Warner, W., 79. 

Watson, G. N., 119,169,169,174,179, 
181,182. 

Weber, W. E., 217. 

Weddle, T., 151. 

Weldon, W. F. R., 333. 

Whewell, W., 86, 286. 

Whittaker. E. T., 119, 120, 124, 128, 
169, 169, 174, 179, 181, 182, 280, 

OAO 077 07Q 

Woolhouse* W. S. B., 168, 286, 370. 
Wronski, H., 121. 

Yule, G. U., 334. 




GENERAL INDEX 


Abel’s integral equation, 376 
Adams' method of solving differential 
equations, 863 
Algcbraio equations, 78 
Alternants, 380 
Argument, 1 

Arithmetic mean of absolute devia¬ 
tions, 184 

„ „ iH>stulatc of, 215, 

217, 321 
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Asymptotic expansion of error function, 
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Axioms for XK)8tulate of arithmetic 
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Barycentre, symmetric, 249 
Bernoullian numbers, 134 
Bernoulli’s method of solving equa¬ 
tions, 98 

,, M of solving equa¬ 

tions compared 
with root-squaring 
method, 112 
„ polynomials, 141 
Bertrand’s proof of normal law for two 
variables, 321 
Bessel function, 119 
Bessel’s formulae for probable error of 
aritlimetic mean, 206 
,, interpolation formula, 39, 47 
Binomial Theorem, 15 
Bronwiu’s formula for integi-ation, 159 
Brouncker’s series, 368 
Brownian motion, 208 
Burnside’s formula for double integrals, 
375 


Cauchy’s formula for trigonometric 
interpolation, 282 


Central differences, 35 
Central-difference formula for numerical 
integration, 146 

Chebyshefs formulae for integration, 
158 

Coefficient of correlation, 329, 335 
Coefficients, Fourier, 263 

„ ,, probable error of, 

280 

Coin, frequency distribution of tosses 
of, 176 

Coincident roots of an equation, 117 
Complex roots of an equation, 113 
Compound deviation, modulus of pre¬ 
cision of, 175 

Condition, equations of, 209 
Conjunctive probability, 817 
Conservation, theorems of, 307 
Constituent tides, 843 
Contingency methods, 388 
Correlation, 317-342 

„ as a test for periodicity, 

345 

,, coefficient, 329, 335 

,, ratio, 336 

Cotes’ assumption, 322 
Cubic, numerical solution of, 124 


Daiidelin’s method of solving equa¬ 
tions, 106 

Data, smoothing of, 285 
Derivatives of a function, 62 

„ of a function in terms of 
central differences, 64 
„ of a function in terms of 
divided differences, 65 
Determinants, 71 
Deviation, 168 
Difference tables, 2 
Differences, for functions of two argu¬ 
ments, 371 

„ in terms of derivatives, 364 
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Differences of data, prol^ble value of, 
297 

„ central, 85 

„ divided, 20, 96, 104, 871 

„ loading, 8 

Differentiation, generalised, 876 
Direct measurements of a single quan¬ 
tity, 220 

Divid^ differences, 20, 96, 104 

,9 „ for functions of two 

arguments, 871 
,9 „ with repcatedargu- 

ments, 27 

Double integrals, computation of, 874 


Edgewoith’s method, 269 
Encke roots, 107 
Equal coefficients, method of, 236 
Equations, algebraic and transcen¬ 
dental, 78-181 

„ linear, 75, 281, 284, 236, 

239 

„ of condition, 209 

„ „ reduction to 

linear form, 214 
„ simultaneous, 88, 90 

Error Function, 179 
,, mean, 188, 184 
„ Inean, of a determination, 243 
,, of mean square, 183 
„ probable, 184 
„ ,, of Fourier coefficients, 

280 

Errol'S, 210 

,, of observation, effect on fre¬ 
quency curves, 206 
Euler-Maclaurin formula, 134, 140 
Eyerett*s formula, 40, 46 
Exponentials, interpolation by, 369 
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„ > Stirling's approximation to, 
188 
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Fechner's psychophysical law, 217 
Fidelity to data, expression of, 305 
Fitting inverse powers,. 248 

„ normal frequency curves, 186- 
196 

Forcing of numbers, 187 
Fourier analysis, 260-284 
Fredholm's integral equation, 881 
Frequency distribution, 164 

„ „ involving cor¬ 

relation, 819 


Frequency distribution, not normal, 
178, 884 

,, function, approximation to, 
171 


,, general law of, 172, 824 

Function, 1 

,, given as series, roots of, 118 

,, interpolatory, 20 


Gal ton’s quincunx, 167 
Gauss* formulae of interpolation, 36, 46 
,, f, ,, ,, back- 

waid, 37 

„ „ for integration, 159 

,, ,, for interpolation (tri¬ 

gonometric), 288 

„ method of solution of normal 
equations, 234. 

„ postulate of the arithmetic 

mean, 215, 217 

„ Theoria Comhinaiimis proof 

of the Method of the Least 

Squares, 224 

,, Theoria Motua proof of the 

normal law, 218 

Gauss and Seidel's solving process, 255 
Genre of entire functions, 119 
Graduation, 285 

„ Aitken's development of 
Whittaker’s method, 
308 

M formula, Higham’s, 289 
,, ,, Rhodes’s, 316 

„ „ Sheppard’s, 291,300 

„ „ Si)encer’s, 290 

„ „ Whittaker’s, 803 

„ ,, Woolhouse’s, 286 

Graeffe's method of solving equations, 
106 

Gregory's formula of integration, 143 
Gregory-Newton formula, 10 

,, ,, generalised 

to two arguments, 
873 


Hardy's formula for integration, 151 
Hermite’s formula for trigonometric 
interpolation, 288 
„ functions, 174 

Hill^rt’s integral equation, 381 
Homer's method, 100 

„ ,, and divided differ¬ 

ences, 104 

Incomidete data for normal curve, 196 
„ Gamma Function, 877 
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Indirect observations, Gauss' Thcaria 
Morns discussion, 223 
Infinite series, roots of, 118 
Integral equations, 376 
Integrals, double, computation of, 374 
Integration, numerical, 132 
Interlaced parabolas, graduation for¬ 
mula, 300 

Interpolation by exponentials, 369 
,, by sine series, 263 

,, inverae, 60 

,, trigonometric, 263 

,, trigonometric, for un¬ 

equal intervals, 282 
interpolation formula: 
for functions of two arguments, 371 
for quinquennial sums, 57 
Gregory-Newton, 10, 26 

„ backward, 365 

Lagrange's, 28 

„ remainder term in, 32 

Laplace-Everett, 40, 46 
Newton, for unequal intervals, 24 
Newton-Bessel, 39, 47 
Newton‘Gauss, 36, 46 

„ backward, 37 
Newton-Stirling, 38, 43, 46 
Interpolatory functions, 20 
Inverse interpolation, 60 
Iteration, 79 

Jacobi’s theorem in Least Squares, 251 

King's formula for quinquennial sums, 
57 

Lagrange’s interpolation formula, 28 
„ „ „ remain. 

der term in, 32 

Laplace-Everett formula, 40, 46 
Laplace’s hypothesis on frequency 
distributions, 167 
„ proof of Method of Least 
Squares, 224 

Least Squares, Method of, 209-259 
Legendre polynomial, 74 [209 

Legendre’s principle of Least Squares, 
Linear equations, solution of, 75, 231, 
234, 236, 239 

,, function of deviations, prob¬ 
ability of, 168 

,, function of unknowns, weight 
of, 242 

Lobachevsky’s method of solving 
equations, 106 


Lozenge diagram, 43 

Lubb^k’s formula for summation, 14'9 

Mayer’s method, 258 
Mean absolute deviation, 184 
,, contingency, 338 
,, error, 183, 184 
„ „ of a determination, 243 

,, of observations, weight of, 222 
Means^ allowance for, in Fourier 
analysis, 279 

„ connected with normal dis¬ 
tributions, 182 
Median, 188 

,, probable error of, 197 
Method of equal coefTicients, 236 
,, of Least Squares, 209-259 
,, „ ,, alternatives 

to, 258 

Minimum approximation method, 259 
Modulus, 80 

„ of precision, 175, 179 

„ ,, „ accuracy of deter¬ 

mination, 199 

Moments, computation by summation, 
191 

Multiple normal correlation, 340 

Newton’s formula for unequal intervals, 
24 

„ ,, for unequal intervals, 

generalised to tw'o 
arguments, 371 

Newton-Bessel interpolation formula, 
39, 47 

Newton-Cotes formulae of integration, 
152 

Newton-Gauss interpolation formula, 
36, 46 

Newton-Gauss interpolation formula, 
backward, 37 

Newton-Gregory interpolation formula, 
10, 26 

Newton-Gregory interpolation formula, 
backward, 365 

Newton-Raphson method, 84, 90 
Newton-Stirling interpolation formula, 
38, 43, 46 
Noniograjdiy, 128 

Normal curve with incomplete data, 196 
„ equations, 211, 224 

,, ,, solution of, 231, 

234, 236, 239 

„ frequency distributions, 164- 
208 
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Normal law, Gauss’ Theoria Motus 
proof of, 218 

„ „ non-uuiyersality of, 177 

„ „ of frequency for two 

vaiiables, 321 

„ „ of frequency for two 

variables, determina¬ 
tion of constants, 824 
Nucleus of an integral equation, 876 

Operators, symbolic, 4 

Parabolic rule for integration, 156 
Parabolic-cylinder functions, 174 
Parameter connected with a normal 
distribution, 183, 186 
Period, adjustment to, in a Fourier 
analysis, 278 

Periodicities, search for, 343-362 
Periodicity, correlation as a test for, 
845 

Periodogram, 346 

„ equation of, 348 

,, Schuster’s, 361 

Peters’ fonnulae for probable error of 
arithmetic mean, 206 
Pivotal element, 71 
Poisson’s integral equations, 377 
Polynomial, differences of, 5 
Polynomials of Bernoulli, 141 
Position, Rule of False, 92 
Postulate of the arithmetic mean, 215, 
217 

Powers, sums of, 137, 294 
Precision, measure of, 245 

,, modulus of, 175, 179 
Probability, conjunctive, 317 
Probable error, 184 

,, „ determination of, from 

residuals, 204 

,, }) of the arithmetic mean, 

196, 206 

,, M of Fourier coefficients, 

280 

„ „ of the median, 197 

„ »» of standard deviation 

and modulus of pre¬ 
cision, 201 

„ value of constant differences 
of data, 297 

Prony’s method of interpolation by 
exponentials, 369 

Quailratic, nomogram for, 128 
„ form, reduction of, 235 

„ mean deviation, 183. 


Quadratic mean error of a determina 
tion, 245 
Quartile, 184 

„ probable error of, 203 
Quincunx, Galtop’s, 167 

Repeated roots of an equation, 117 
Reproductive property of normal law, 
175 

Residuals, 234 

,, determination of prol^ble 
error from, 204 
„ sum of squares, 246 
Root-mean-square contingency, 338 
Root - squaring method of solving 
equations, 106 

Root - squaring method of solving 
equations, compared with Bernoulli’s 
method, 112 
Roots, square, 79 
Ruffini-Horner method, 100 

„ ,, ,, and divided 

differences, 104 
Rule of False Position, 92 

Schuster’s periodogram, 346, 361 
Secular change, in Fourier analysis, 
279 

Seidel’s solving process, 255 
Semin variants, 171 

Series, formula for root of an equation, 
120 

„ Fourier, 263 
,, roots of, 118 

,, summation of slowly-conver* 
gent, 368 

Sheppard’s corrections, 194 

„ graduation formulae, 291 
Shovelton’s formula for integration, 
151 

Simpson’s formula, 156, 157* 

,, „ double, 375 

Simultaneous equations, 88, 90 
Skew frequency distributions, 173 
Slowly-convergent series, summation 
of, 368 

Smoothing of data, 285 
Smoothness, expression of, 305 
Solving process of Gauss and Seidel, 
255 

Spencer’s graduation formula, 290 
Square roots, 79 
Standard deviation, 183 

^ „ accuracy of deter* 

mination of, 19 
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Standai-d deviation for attributes taken 
singly, 327 

,, „ of sum, etc., of two 

measures, 328 

Stirling’s approximation to the fac¬ 
torial, 138 

,, interpolation formula, 38,43, 

46 

,, method for slowly-convcr- 

gent series 368 
Subtabiilation, 53 
Sum of squares of residuals, 246 
Summation, 191 

„ by ?i’s, 287, 288 

„ by parts, 52 

„ formulae of graduation, 288 
,, numerical, 132, 136, 149 

„ of slowly-convergeiit series, 

368 

Symmetric barycentre, 249 

Thcoria Motiis^ Gauss’, 215, 224 
Three - eighths rule for integration, 
157 

Transcendental equations, 78 
Transformations preliminary to inter¬ 
polation, 50 

Trapezoidal rule for integration, 156 
Trigonometric interpolation, 263 
Trisection method of solving cubic, 
124 


Twelve-ordinate scheme for Fourier 
analysis, 267 

Twenty-four ordinate scheme for 
Fourier analysis, 273 

Unequal intervals, trigonometric inter¬ 
polation for, 282 

Unknowns connected by rigorous equa 
tions, 252. 

J Weber-Fechner psychophysical law, 217 
AVeddle’s formula for integration, 151 
Weight of a linear function, 222, 226 
„ of a linear function of the un¬ 
knowns, 242 

„ of an observation, 220, 221, 
225, 228 

„ of the unknowns, 239 
Whittaker’s formula for root of a series 
120 

,, graduation method, 303 
,, ' nomogram for quadratic, 

128 

„ solutions of integral equa¬ 

tions, 377 

Woolhouse’s formula for integration. 
158 

,, formula of graduation. 

286, 289 


Zero, ditferences of. 6 






